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BACKGROUND OF THE INVENT I QM 
This is a continuation -in-part application of U.S. Patent 
Application Serial No. 699,181, filed February 5, 1985. 
Endogenous proteolytic enzymes serve to degrade invading organ- 
isms, antigen-antibody complexes and certain tissue proteins 
which are no longer necessary or useful to the organism. In a 
normally functioning organism, proteolytic enzymes are produced 
in a limited quantity and are regulated in part through specific 
inhibitors . 

Metalloproteinases are enzymes present in the body which are 
often involved in the degradation of connective tissue. While 
some connective tissue degradation is necessary for normal func- 
tioning of an organism, an excess of connective tissue degrada- 
tion occurs in several disease states and is believed to be at- 
tributable, at least in part, to excess metalloproteinase . It is 
believed that metalloproteinases are at least implicated in peri- 
odontal disease, corneal and skin ulcers, rheumatoid arthritis 
and the spread of cancerous solid tumors. 

These diseases generally occur in areas of the body which 
contain a high proportion of collagen, a particular form of con- 
nective tissue. An examination of patients with these diseases 
of connective tissue has revealed an excessive breakdown of the 
various components of connective tissues, including collagen 
proteoglycans and elastin. Therefore, it has been deduced that 
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an excessive concentration of a particular metalloproteinase , fo 
example* collagenase, proteoglyconuse, gelatinase, and certain 

j elastases, may cause or exacerbate the connective tissue destruc- 

|l 

;| tion associated with the aforementioned diseases. 

\ In the normal state, the body possesses metalloproteinase 

; i 

inhibitors which bind to metalloproteinases to effectively pre- 
vent these enzymes from acting on their connective tissue sub- 
,; strates. Specifically, in- a healthy organism, metalloproteinase 
inhibitors are present in concentrations sufficient to interact 
with metalloproteinases to an -extent which allows sufficient 
quantities of metalloproteinase to remain active while binding 
the excess metalloproteinase so that the connective tissue damag* 
seen in the various diseases does not occur. 

It is postulated that one immediate cause of the connective 
tissue destruction present in the foregoing disease states is an 
imbalance in the relative metalloproteinase/metalloproteinase in- 
hibitor concentrations. In these situations, either due to an 
excessive amount of active metalloproteinase or a deficiency in 
the amount of active metalloproteinase inhibitor, the excess met- 
alloproteinase is believed to cause the connective tissue degra- 
dation responsible for causing or exacerbating the disease. It 
is postulated that, by treating persons with connective tissue 
diseases with metalloproteinase inhibitors, the degradative 
action of the excess metalloproteinase may be curtailed or 
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' halted. Therefore, particular metalloproteinase inhibitors of 
'specific interest to the present inventors are collagenase inhib- 
, itors because it is believed that these inhibitors would be phar- 
1 maceutically useful in the treatment or prevention of connective 

tissue diseases. 

The existence of metalloproteinase and metalloproteinase in- 
hibitors has been discussed in the scientific literature. For 
example, Sellers et al. , Biochemical And Bio physical Research 
communications 87:581-587 (1979), discusses isolation of rabbit 
bone collagenase inhibitor. Collagenase inhibitor isolated from 
human skin fibroblasts is discussed in Stricklin and Welgus, 
J. B.C. 258:12252-12258 (1983) and Welgus and Stricklin, J. B.C. 
258:12259-12264 -(1983). The presence of collagenase inhibitors 
in naturally-occurring body fluids is further discussed in Murphy 
et al. , Biochem. J. 195:167-170 (1981) and Cawston et al. , 
Arthritis and Rheumatism, 27:285 (1984). In addition, metallo- 
proteinase inhibitors are discussed by Reynolds et al. in 
Cellular Interactions , Dingle and Gordon, eds., (1981). Although 
these articles characterize particular, isolated metallopro-. 
teinase inhibitors and discuss, to some extent, the role or 
potential role of metalloproteinases in connective tissue disease 
treatment and speculate on the ability of metalloproteinase 
inhibitors to counteract this destruction, none of these re- 
searchers had previously been able to isolate a portable DNA 



law orncts — 3 — 

flnnecan. henderson 
Farabow. Garrett 

8 DUNNER. 

I77S « STHCCT N.W. 
WASHINGTON. O C 20006 



i ' 

! 

j 

sequence capable of directing intracellular production of metal- 
| loproteinase inhibitors or to create a recombinant-DNA method fc 
j } the production of these inhibitors. 

l| Surprisingly, the present inventors have discovered a porta- 
ble DNA sequence capable of directing the recombinant-DNA synthe- 
sis of metalloproteinase inhibitors* These metalloproteinase in- 
hibitors are biologically equivalent to those isolated from humar 
skin fibroblast cultures. The metalloproteinase inhibitors of 
jthe present invention, prepared by the recombinant-DNA methods 
set forth herein, vill enable increased research into prevention 
and treatment of metalloproteinase-induced connective tissue dis- 
eases. In addition, the metalloproteinase inhibitors of the 
present invention are useful in neutralizing metalloproleinases , 
f{ including the excess metalloproteinase associated with disease 

Ml states. Therefore, it is believed that a cure for these diseases 

%i will be developed which will embody, as an active ingredient, the 

metalloproteinase inhibitors of the present invention. Further- 
M; more, the metalloproteinase inhibitors of the present invention 

jk are capable of interacting with their metalloproteinase targets 

in a manner which allows the development of diagnostic tests for 
degradative connective tissue diseases using the newly discovered 
inhibitors . 

The recombinant metalloproteinase inhibitors discussed here- 
in interact stoichiometrically (i-e., in a 1:1 ratio) with their 
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Ilmetalloproteinase targets* In addition, these metalloproteinase 

'Lnhi^tors are heat resistant, acid stable, glycosylated, and ex- 

\ j 

ijhibit a high isoelectric point. 

i I 

:| SUMMARY OF THE INVENTION 

The present invention relates to metalloproteinase inhib- 

l r 

; 4tors and a recombinant-DNA method of producing the same and to 
portable DNA sequences capable of directing intracellular produc- 
tion of the metalloproteinase inhibitors. Particularly, the 
present invention relates to a collagenase inhibitor, a recombi- 
nant-DNA method for producing the same and to portable DNA se- 
quences for use in the recombinant method. The present invention 
also relates to a series of vectors containing these portable DNA 
sequences . 

One object of the present invention is to provide a metallo- 
proteinase inhibitor, which can be produced in sufficient quan- 
tities and purities to provide economical pharmaceutical composi- 
tions which possess metalloproteinase inhibitor activity. 

An additional object of the present invention is to provide 
a recombinant-DNA method for the production of these metallopro- 
teinase inhibitors. The recombinant metalloproteinase inhibitors 
produced by this method are biologically equivalent to the metal- 
loproteinase inhibitor isolable from human skin fibroblast cul- 
tures. 
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To facilitate the recombinant-DNA synthesis of these metal- 
loproteinase inhibitors, it is a further object of the present 
invention to provide portable DNA sequences capable of directing 
intracellular production of metalloproteinase inhibitors. It is 
also an object of the present invention to provide cloning 
vectors containing these portable sequences. These vectors are 
lt capable of being used in recombinant systems to produce pharma- 

iiceutically useful quantities of metalloproteinase inhibitors. 

: i 

'! Additional objects and advantages of the invention will be 

jset forth in part in the description which follows, and in r>art 

'Iwill be obvious from the description or may be learned from prac- 

i , 
"tice of the invention. The objects and advantages may be real- 
ized and attained by means of tne instrumentalities and combina- 
tions particularly pointed out in the appended claims. 

To achieve the objects and in accordance with the purposes 
of the present invention, metalloproteinase inhibitors are set 
forth, which are capable of stoichiometric reaction with metalio- 
proteinases. These metalloproteinase inhibitors are remarkably 
heat resistant, acid stable, glycosylated, and exhibit a high 
isoelectric point. Furthermore, these metalloproteinase inhib- 
itors are biologically equivalent to those inhibitors isolated 
'from human skin fibroblast cultures. 

To further achieve the objects and in accordance with the 
purposes of the present invention, as embodied and broadly 
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described herein, portable DNA sequences coding for metalloprc- 
teinase inhibitors are provided. These sequences comprise nucle- 
otide sequences capable of directing intracellular production of 
metalloproteinase inhibitors. The portable sequences may be ei- 
ther synthetic sequences or restriction fragments ("natural" DNA 
sequences). In a preferred embodiment, a portable DNA sequence 
is isolated from a human fibroblast cDNA library and is capable 
! 'of directing intracellular production of a collagenase inhibitor 
| which is biologically equivalent to that inhibitor which is 
,'' isolable from a human skin fibroblast culture. 

: The coding strand of a first preferred DNA sequence which 

i 

, : has been discovered has the following nucleotide sequence: 



10 20 30 40 50 60 

GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT GTGTCCCACC CCACCCACAG 

70 80 90 100 110 120 

ACGGCCTTCT GCAATTCCGA CCTCGTCATC AGGGCCAAGT TCGTGGGGAC ACCAGAAGTC 

130 140 150 160 170 180 

AACCAGACCA CCTTATACCA GCGTTATGAG ATCAAGATGA CCAAGATGTA TAAAGGGTTC 

190 200 210 220 230 240 

CAAGCCTTAG GGGATGCCGC TGACATCCGG TTCGTCTACA CCCCCGCCAT GGAGAGTGTC 

250 260 270 280 290 300 

TGCGGATACT TCCACAGGTC CCACAACCGC AGCGAGGAGT TTCTCATTGC TGGAAAACTG 

310 320 330 340 350 360 

CAGGATGGAC TCTTGCACAT CACTACCTGC AGTTTCGTGG CTCCCTGGAA CAGCCTGAGC 

370 380 390 400 410 420 

1 TTAGCTCAGC GCCGGGGCTT CACCAAGACC TACACTGTTG GCTGTGAGGA ATGCACAGTL 

430 440 450 460 470 480 

' TTTCCCTGTT TATCCATCCC CTGCAAACTG CAGAGTGGCA CTCATTGCTT GTGGACGGAC 
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490 500 ^ n 520 530 54 

CAGCTCCTCC AAGGCTCTGA AAAGGGCTTC CAGTCCCGTC ACCTTGCCTG CCTGCCTCG 

550 560 570 580 590 6C 

GAGCCAGGGC TGTGCACCTG GCAGTCCCTG CGGTCCCAGA TAGCCTGAAT CCTGCCCGG 

610 620 630 640 650 66 

■' GTGGAAGCTG AAGCCTGCAC AGTGTCCACC CTGTTCCCAC TCCCATCTTT CTTCCGGAC 

670 680 690 700 

ATGAAATAAA GAGTTACCAC CCAGCAAAAA AAAAAAGGAA TTC 

The nucleotides represented by the foregoing abbreviations are 

set forth in the Detailed Description of the Preferred Embodi- 

ijments. 

: i 

i A second preferrred DNA sequence has been discovered which 
has an additional nucleotide sequence 5' to the initiator se- 
quence. This sequence, which contains as the eighty-second 

[ through four-hundred-thirty-second nucleotides nucleotoides 1 

through 351 of the first preferred sequence set forth above, hai 

the following nucleotide sequence: 

10 20 30 40 50 £ 

GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAGAG AACCCACCAT GGCCCCCT 

70 80 90 100 110 1 

GACCCCTGGC TTCTGCATCC TGTTGTTGCT GTGGCTGATA GCCCCAGCAG GGCCTGCA( 

130 140 150 * 160 170 1 

TGTGTCCCAC CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCAT CAGGGCCA. 

190 200 210 220 230 2 

, TTCGTGGGGA CACCAGAAGT CAACCAGACC ACCTTATACC AGCGTTATGA GATCAAGA 

250 260 270 280 290 3 

IACCAAGATGT ATAAAGGGTT CCAAGCCTTA GGGGATGCCG CTGACATCCG GTTCGTCT 

310 320 330 340 350 3 

ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGGT CCCACAACCG CAGCGAGG 
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370 380 390 400 410 

TTTCTCATTG CTGGAAAACT GCAGGATGGA CTCTTGCACA TCACTACCTG CAGT7TC 

430 

GCTCCCTGGA AC 

A third preferred DNA sequence which incorporates the 5' re 

jlgion of the second preferred sequence and the 3' sequence of the 

i! first preferred sequence, has the following nucleotide sequence: 

10 20 30 40 50 6 

GGCCATCGCC GCAGATCCAG CGCCCAGAGA GACACCAGAG AACCCACCAT GGCCCCCTT 

70 80 90 100 110 12 

GACCCCTGGC TTCTGCATCC TGTTGTTGCT GTGGCTGATA GCCCCAGCAG GGCCTGCAC 

!30 140 150 160 170 18 

TGTGTCCCAC CCCACCCACA GACGGCCTTC TGCAATTCCG ACCTCGTCAT CAGGGCCAA 

\ iso 200 210 220 230 24 

TTCGTGGGGA CACCAGAAGT CAACCAGACC ACCTTATACC AGCGTTATGA GATCAAGAT 

250 260 270 280 290 30 

ACCAAGATGT ATAAAGGGTT CCAAGCCTTA GGGGATGCCG CTGACATCCG GTTCGTCTA 

310 320 330 340 350 36 

ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGGT CCCACAACCG CAGCGAGGA 

370 380 390 400 410 42 

TTTCTCATTG CTGGAAAACT GCAGGATGGA CTCTTGCACA TCACTACCTG CAGTTTCGT 

430 440 450 460 470 48 

GCTCCCTGGA ACAGCCTGAG CTTAGCTCAG CGCCGGGGCT TCACCAAGAC CTACACTGT 

490 500 510 520 530 54 

GGCTGTGAGG AATGCACAGT GTTTCCCTGT TTATCCATCC CCTGCAAACT GCAGAGTGG 

j 550 560 570 580 590 6C 

lACTCATTGCT TGTGGACGGA CCAGCTCCTC CAAGGCTCTG AAAAGGGCTT CCAGTCCCC 

i 

! 610 620 630 640 650 66 

CACCTTGCCT GCCTGCCTCG GGAGCCAGGG CTGTGCACCT GGCAGTCCCT GCGGTCCCJ 

670 680 690 700 710 72 

ATAGCCTGAA TCCTGCCCGG AGTGGAAGCT GAAGCCTGCA CAGTGTCCAC CCTGTTCCC 
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730 740 750 760 770 780 

ZTCCCATCTT TCTTCCGGAC AATGAAATAA AGAGTTACCA CCCAGCAAAA AAAAAAAGGA 
To facilitate identification and isolation of natural DNA 

sequences for use in the present invention, the inventors have 

developed a human skin fibroblast cDNA library. This library 

1 1 
t 

Ijcontains the genetic information capable of directing a cell to 
^synthesize the metalloproteinase inhibitors of the present inven- 

if 

tion. Other natural DNA sequences which may be used in the 

I; 

iirecombinant DNA methods set forth herein may be isolated from 

•I 

'human genomic libraries. 

■| Additionally, portable DNA sequences useful in the processes 
of the present invention may be synthetically created. These 

.synthetic DNA sequences may be prepared by polynucleotide synthe- 
sis and sequencing techniques known to those of ordinary skill in 
the art. 

Additionally, to achieve the objects and in accordance with 
the purposes of the present invention, a recombinant-DNA method 
is disclosed which results in microbial manufacture of the in- 
stant metalloproteinase inhibitors using the portable DNA se- 
quences referred to above. This recombinant DNA method com- 
prises: 

': (a) preparation of a portable DNA sequence capable of 

' i 

M directing a host microorganism to produce a protein 

having metalloproteinase inhibitor activity, preferably 
collagenase inhibitor activity; 
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(b) cloning the portable DMA sequence into a vectd capable 
of being transferred into and replicating in a host mi- 
croorganism, such vector containing operational ele- 
ments for the portable DNA sequence; 

(c) transferring the vector containing the portable DNA se- 
quence and operational elements into a host microorga- 
nism capable of expressing the metalloproteinase inhib- 
itor protein; 

(d) culturing the host microorganism under conditions 
appropriate for amplification of the vector and expres- 
sion of the inhibitor; and 

i 

(e) in either order: 

(i) harvesting the inhibitor; and 

(ii) causing the inhibitor to assume an active, terti- 
ary structure whereby it possesses metallopro- 
teinase inhibitor activity. 

To further accomplish the objects and in further accord with 
the purposes of the present invention, a series of cloning vec- 
tors are provided comprising at least one of the portable DNA se- 
quences discussed above. In particular, plasmid pUC9-F5/237P10 
j is disclosed. 

It is understood that both the foregoing general description 
and the following detailed description are exemplary and explana- 
tory only and are not restrictive of the invention, as claimed. 
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The accompanying drawing, which is incorporated in and con- 
stitutes a part of this specification, illustrates one embodimen 
of the invention and, together with the description, serves to 
explain the principles of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a partial restriction map of the plasmid 
pUC9-F5/237P10. 

; DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Reference will now be made in detail to the presently pre- 
ferred embodiments of the invention, which, together with the 
drawing and the following examples, serve to explain the princi- 
ples of the invention. 

As noted above, the present invention relates in part to 
portable DNA sequences capable of directing intracellular produc- 
tion of metalloproteinase inhibitors in a variety of host micro- 
organisms. "Portable DNA sequence" in this context is intended 
to refer either to a synthetically-produced nucleotide sequence 
or to a restriction fragment of a naturally occuring DNA se- 
quence. For purposes of this specification, "metalloproteinase 
inhibitor" is intended to mean the primary structure of the pro- 
tein as defined by the codons present in the deoxyribonucleic 
acid sequence which directs intracellular production of the amine 
acid sequence, and which may or may not include post-transla- 
tional modifications. It is contemplated that such 
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post-translat ional modifications include, for example, 
glycosylat ion. It is further intended that the term "metallopro- 

i 

j teinase inhibitor" refer to either the form of the protein as 
; ! would be excreted from a microorganism or the methionyl-metallo- 

; proteinase inhibitor as it may be present in microorganisms from 

"I 

i| which it was not excreted, 

Ti 

: | In a preferred embodiment, the portable DNA sequences are 

' capable of directing intracellular production of collagenase in- 
hibitors. In a particularly preferred embodiment, the portable 
DNA sequences are capable of directing intracellular production 
\ of a collagenase inhibitor biologically equivalent to that previ- 
ously isolated from human skin fibroblast cultures. By "biologi- 
cally equivalent" , as used herein in the specification and 
claims, it is meant that an inhibitor, produced using a portable 
DNA sequence of the present invention, is capable of preventing 
collagenase- induced tissue damage of the same type, but not nec- 
essarily to the same degree, as a native human collagenase inhib- 
itor, specifically that native human collagenase inhibitor 
isolable from human skin fibroblast cell cultures. 

A first preferred portable DNA sequence of the present 
invention has a nucleotide sequence as follows: 
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10 20 30 40 50 60 

GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT GTGTCCCACC CCACCCACAG 

70 80 90 100 HO 120 

! ACGGCCTTCT GCAATTCCGA CCTCGTCATC AGGGCCAAGT TCGTGGGGAC ACCAGAAGTC 

|| 13Q 140 150 160 170 180 

! AACCAGACCA CCTTATACCA GCGTTATGAG ATCAAGATGA CCAAGATGTA TAAAGGGTTC 

■ igo 200 210 220 230 240 

! CAAGCCTTAG GGGATGCCGC TGACATCCGG TTCGTCTACA CCCCCGCCAT GGAGAGTGTC 

.! 250 260 270 280 290 300 

! TGCGGATACT TCCACAGGTC CCACAACCGC AGCGAGGAGT TTCTCATTGC TGGAAAACTG 

i 310 320 330 340 350 360 

CAGGATGGAC TCTTGCACAT CACTACCTGC AGTTTCGTGG CTCCCTGGAA CAGCCTGAGC 

370 380 390 400 410 420 

TTAGCTCAGC GCCGGGGCTT CACCAAGACC TACACTGTTG GCTGTGAGGA ATGCACAGTG 

430 440 450 460 470 480 

TTTCCCTGTT TATCCATCCC CTGCAAACTG CAGAGTGGCA CTCATTGCTT GTGGACGGAC 

490 500 510 520 , 530 540 

CAGCTCCTCC AAGGCTCTGA AAAGGGCTTC CAGTCCCGTC ACCTTGCCTG CCTGCCTCGG 

550 560 570 580 590 600 

GAGCCAGGGC TGTGCACCTG GCAGTCCCTG CGGTCCCAGA TAGCCTGAAT CCTGCCCGGA 

610 620 630 640 650 660 

GTGGAAGCTG AAGCCTGCAC AGTGTCCACC CTGTTCCCAC TCCCATCTTT CTTCCGGACA 

670 680 690 700 

ATGAAATAAA GAGTTACCAC CCAGCAAAAA AAAAAAGGAA TTC 

wherein the following nucleotides are represented by the abbrevi- 



ations indicated below. 
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Nucleotides Abbreviation 
Ueoxyadenylic acid A 
Deoxyguanylic acid G 
Deoxycytidylic acid C 
Thymidylic acid T 

A second preferred portable DNA sequence of the present 

invention has the following nucleotide sequence: 

10 20 30 

GGCCATCGCC GCAGATCCAG CGCCCAGAGA 

70 ,80 , , 90 

GACCCCTGG'C T^CTGCATCC TGTTGTTGCT 

130 140 150 

TGTGTCCCA'C CCCACCCACA GACGGCCTTC 

190 200 210 

TTCGTGGGGA CACCAGAAGT CAACCAGACC 

250 260 270 

ACCAAGATGT ATAAAGGGTT CCAAGCCTTA 

310 320 330 

ACCCCCGCCA TGGAGAGTGT CTGCGGATAC 

370 380 390 

TTTCTCATTG CTGGAAAA CT GCAGGATGGA 

430 

GCTCCCTGGA AC 

In this second preferred sequence, an open reading frame exists 
from nucleotides 1 through 432. The first methionine of this 
reading frame is encoded by nucleotides by 49 through 51 and is 
the site of translation initiation. It should be noted that the 
amino acid sequence prescribed nucleotides 49 through 114 is not 



40 


50 


60 


GACACCAGAG 


AACCCACCAT 


GGCCtCCTTT 


100 


110 


120 


GTGGCTGATA 


gccccagcXg 


GGCCTGCACC 


160 


170 


180 


TGCAATTCCG 


ACCTCGTCAT 


CAGGGCCAAG 


220 


230 


240 


ACCTTATACC 


AGCGTTATGA 


GATCAAGATG 


280 


290 


30C 


GGGGATGCCG 


CTGACATCCG 


GTTCGTCTAC 


340 


350 


360 


TTCCACAGGT 


CCCACAACCG 


CAGCGAGGAG 


400 


410 


42C 


CTCTTGCACA 


TCACTACCTG 


CAGTTTCGTC 
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1 found in the mature metalloproteinase. It is believed that this 

i 

u 

>! sequence is the leader peptide of the human protein. 

i 

j A third preferred portable DNA sequence has the mucleotide 

i| 

! J sequence: 





1U 




1(\ 
i<J 










\ GGCCATCGCC 


GCAGATCCAG 


CGCCCAGAGA 


GACACCAGAG 


AACCCACCAT 


GGCCCCCTTT 




/ U 


an 


90 


100 




120 




GACCCCTGGC 


TTCTGCATCC 


TGTTGTTGCT 


GTGGCTGATA 


GCCCCAGCAG 


GGCCTGCACC 






140 


i ^n 


i fin 


170 


180 

1 U U 




: TGTGTCCCAC 


CCCACCCACA 


GACGGCCTTC 


TGCAATTCCG 


ACCTCGTCAT 


CAGGGCCAAG 




i on 


c u u 


?i n 


220 


230 






TTCGTGGGGA 


CACCAGAAGT 


CAACCAGACC 


ACCTTATACC 


AGCGTTATGA 


GATCAAGATG 




TEH 








290 


300 

J v w 




ACCAAGATGT 


ATAAAGGGTT 


CCAAGCCTTA 


GGGGATGCCG 


CTGACATCCG 


GTTCGTCTAC 




310 


"3 *5 fi 






7 ^n 






ACCCCCGCCA 


TGGAGAGTGT 


CTGCGGATAC 


TTCCACAGGT 


CCCACAACCG 


CAGCGAGGAG 






jSU 




Ann 


a i n 


a? n 




TTTCTCATTG 


CTGGAAAACT 


GCAGGATGGA 


CTCTTGCACA 


TCACTACCTG 


CAGTTTCGTG 




4 3Q 






a c n 


a 7 n 


& r n 

t y u 


ui 


GCTCCCTGGA 


ACAGCCTGAG 


CTTAGCTCAG 


CGCCGGGGCT 


TCACCAAGAC 


CTACACTGTT 




490 


C Pt 

3 U U 


31 u 


t; "5 n 


530 






GGCTGTGAGG 


AATGCACAGT 


GTTTCCCTGT 


TTATCCATCC 


CCTGCAAACT 


GCAGAGTGGC 




550 


560 


570 


580 


590 


600 




ACTCATTGCT 


TGTGGACGGA 


CCAGCTCCTC 


CAAGGCTCTG 


AAAAGGGCTT 


CCAGTCCCCT 


0 


610 


620 


630 


640 


650 


660 




CACCTTGCCT 


GCCTGCCTCG 


GGAGCCAGGG 


CTGTGCACCT 


GGCAGTCCCT 


GCGGTCCCAG 




670 


680 


690 


700 


710 


720 




ATAGCCTGAA 


TCCTGCCCGG 


AGTGGAAGCT 


GAAGCCTGCA 


CAGTGTCCAC 


CCTGTTCCC/ 




730 


740 


750 


760 


770 


78C 




CTCCCATCTT 


TCTTCCGGAC 


AATGAAATAA 


AGAGTTACCA 


CCCAGCAAAA 


AAAAAAAGG/ 



lam ornccs 

FlNNECAN. HEW0ER5ON 

r arabosj/. Garrett 
S Dunner 

(773 « ST«£CT.N,W. 
WASM»MSTQN.O. C.2000* 



m This third sequence contains the 5 1 nontranslated region of the 
\ second preferred sequence and the 3' region of the first pre- 
•\ ferred sequence. It is envisioned that this third preferred se- 
'! quence is capable of directing intracellular production of a met- 
alloproteinase analogous to a mature human collagenase inhibitor 
; in a microbial or mammalian expression system. 

It must be borne in mind in the practice of the present 
invention that the alteration of some amino acids in a protein 
'sequence may not affect the fundamental properties of the pro- 
tein. Therefore, it is also contemplated that other portable DN. 
sequences, both those capable of directing intracellular produc- 
tion of identical amino acid sequences and those capable of 
directing intracellular production of analogous amino acid se- 
quences which also possess metalloproteinase inhibitor activity, 
are included within the ambit of the present invention. 

It is contemplated that some of these analogous amino acid 
sequences will be substantially homologous to native human 
metalloproteinase inhibitors while other amino acid sequences, 
capable of functioning as metalloproteinase inhibitors, will no^ 
exhibit substantial homology to native inhibitors. By "substan- 
tial homology", as used herein, is meant a degree of homology to 
a native metalloproteinase inhibitor in excess of 50%, preferabl 
in excess of 60%, preferably in excess of 80%. The percentage 
homology as discussed herein is calculated as the percentage of 
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amino acid rescues found in the smaller of the two sequences 
that aHgn vitn identical amino acid residues in the sequence 
being compared when four gaps in a length of 100 amino acids may 
: be introduced to assist in that alignment as set forth by 

, Dayhoff , M.O. in Atlas of Protein Sequence and Structure Vol. 5, 

i 

p. 124 (1972), National Biochemical Research Foundation, 
Washington, D.C. , specifically incorporated herein by reference, 
i As noted above, the portable DNA sequences of the present 
•invention may be synthetically created. It is believed that the 
means for synthetic creation of these polynucleotide sequences 
are generally known to one of ordinary skill in the art, particu- 
larly in light of the teachings contained herein. As an example 
of the current state of the art relating to polynucleotide syn- 
thesis, one is directed to Matteucci, M.D. and Caruthers, M.H., 
in J. Am. Chem. Soc. 103: 3185 (1981) and Beaucage, S.L. and 
Caruthers, M.H. in Tetrahedron Lett. 22: 1859 (1981), specifical- 
ly incorporated herein by reference. 

Additionally, the portable DNA sequence may be a fragment of 
a natural sequence, i.e., a fragment of a polynucleotide which 
occurred in nature and which has been isolated and purified for 
the first time by the present inventors. In one embodiment, the 
portable DNA sequence is a restriction fragment isolated from a 
cDNA library. In this preferred embodiment, the cDNA library is 
created from human skin fibroblasts. 
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In an alternative embodiment, the portable DNA sequence is 
isolated from a human yenomic library. An example of such a li- 
brary useful in this embodiment is set forth in Lawn et al. Cell 
15: 1157-1174 (1978) , specifically incorporated herein by refer 
j ence. 

i 

I As also noted above, the present invention relates to a se- 

| 

I ries of vectors, each containing at least one of the portable DN, 
j sequences described herein. It is contemplated that additional 
; copies of the portable DNA sequence may be included in a single 
vector to increase a host microorganism's ability to produce 
large quantities of the desired metalloproteinase inhibitor. 

In addition, the cloning vectors* within the scope of the 
present invention may contain supplemental nucleotide sequences 
: preceding or subsequent to the portable DNA sequence. These sup* 
plemental sequences are those that will not interfere with tran^ 
scription of the portable DNA sequence and will, in some in- 
stances as set forth more fully hereinbelow, enhance 
transcription, translation, or the ability of the primary amino 
acid structure of the resultant metalloproteinase inhibitor to 
assume an active, tertiary form. 

A preferred vector of the present invention is set forth in 
Figure 1. This vector, pUC9-F5/237P10 r contains the preferred 
nucleotide sequence set forth above. Vector pUC9-F5/237P10 is - 
present in the C600/pUC9-F5/237P10 cells on deposit in the 
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: ( American Type Culture Collection in Rockville, Maryland under 

, i 

Accession No, 53003. 
;! A preferred nucleotide sequence encoding the metallopro- 

1 teinase inhibitor is identified in Figure 1 as region A. Plasmid 
pUC9-F5/237P10 also contains supplemental nucleotide sequences 
preceding and subsequent to the preferred portable DNA sequence 
in region A. These supplemental sequences are identified as re- 
.gions B and C, respectively. 

In alternate preferred embodiments, either one or both of 
the preceding or subsequent supplemental sequences may be removed 
from the vector of Fig. 1 by treatment of the vector with re- 
% strict ion endonucleases appropriate for removal of the supplemen- 

^ tal sequences. The supplemental sequence subsequent to the por- 

table DNA sequence, identified in Fig. 1 as region C, may be 
removed by treatment of the vector with a suitable restriction 
* endonuclease, preferably HqiAI followed by reconstruction of the 

M= 3 r end of region A using synthetic oligonucleotides and ligation 

J; of the vector with T-4 DNA ligase. Deletion of the supplemental 

U sequence preceding the portable DNA sequence, identified as re- 

gion B in Fig. 1, would be specifically accomplished by the meth- 
od set forth in Example 2. 

In preferred embodiments, cloning vectors containing and ca- 
pable of expressing the portable DNA sequence of the present 
invention contain various operational elements. These 
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i "operational elements," as discussed herein, include at least one 

i 

' promoter, at least one Shine-Dalgarno sequence, at least one 

i 

I terminator codon. Preferably, these "operational elements" also 

i 

!j include at least one operator, at least one leader sequence, and 
. for proteins to be exported from intracellular space, at least 
■ one regulator and any other DNA sequences necessary or preferred 
' for appropriate transcription and subsequent translation of the 
vector DNA. 

Additional embodiments of the present invention are envi- 
sioned as employing other known or currently undiscovered vectors 
which would contain one or more of the portable DNA sequences de- 
scribed herein. In particular, it is preferred that these 
vectors have some or all of the following characteristics: (1) 
possess a minimal number of host-organism sequences; (2) be sta- 
ble in the desired host; (3) be capable of being present in a 
high copy number in the desired host; (4) possess a regulatable 
promoter; (5) have at least one DNA sequence coding for a se- 
lectable trait present on a portion of the plasmid separate from 
that where the portable DNA sequence will be inserted; and (6) b< 
integrated into the vector. 

The following, noninclus ive , list of cloning vectors is be- 
lieved to set forth vectors which can easily be altered to meet 
the above-criteria and are therefore preferred for use in the 
present invention. Such alterations are easily performed by 
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those of ordinary skill in the art in light of the available lit- 
erature and the teachings herein. 

TABLE I 



HOST 
E. coli 



BACILLUS 

B. subtil is 

B. amyloliauef aciens 

B. stearothermophilus 



PSEUDOMONAS 
P. aeruginosa 
p. putida 



CLOSTRIDIUM 
C. perfrinqens 



SACCHAROMYCES 
S . cerevis iae 



Vectors 

pUC8 

pUC9 

pBR322 

pGW7 

placid 

pDP8 

pUBHO 

pSA0501 

pSA2100 

pBD6 

pBD8 

PT127 

RSF1010 
Rmsl49 
pKT209 
RK2 

pSa727 

pJU12 

pJU7 

pJUlO 

pJU16 

pJU13 

YEp24 

YIp5 

YRpl7 



Comments 

Many selectable replicons 
have been characterized. 
Maniatis, T. et al. (1982), 
Molecular Cloning: A 
Laboratory Manual , Cold 
Spring Harbor Laboratory. 

Genetics and Biotechnology 
of Bacilli , Ganesan and 
Hoch, eds., 1984, Academic 
Press. 



Some vectors useful in 
broad host range of gram- 
negative bacteria including 
Xanthomonas and Agrobacterii 



Shuttle plasmids for E. 
coli and C. perfrinqens 
construction ref. Squires, 
C. et al. (1984) Journal 
Bacteriol. 159 :465-471. 
Botstein and Davis in 
Molecular Biology of the 
Yeast Saccharomyces , 
Strathern, Jones, and 
Broach, eds., 1982, Cold 
Spring Harbor Laboratory. 



It is to be understood that additional cloning vectors may now 
exist or will be discovered which have the above- identif ied prop- 
erties and are therefore suitable for use in the present 
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invention. These vectors are also contemplated as being -luhin 
the scope of the disclosed series of cloning vectors into which 
the portable DNA sequences may be introduced, along with any nec- 
essary operational elements, and which altered vector is then in- 
' eluded within the scope of the present invention and would be ca- 
- pable of being used in the recombinant-DNA method set forth more 
/ fully below. 

jj In addition to the above list, an E. coli vector system, as 

"set forth in Example 2, is preferred in one embodiment as a 
'cloning vector. Moreover, several vector plasmids which autono- 
mously replicate in a broad range of Gram Negative bacteria are 
preferred for use as cloning vehicles in hosts of the genera 
Pseudomonas . These are described by Tait, R.C., Close, T.J., 
Lundquist, R.C. , Hagiya, M. , Rodriguez, R.L., and Kado, C.I. in 
Biotechnology, May, 1983, pp. 269-275; Panopoulos, N.J. in 
Genetic Engineering in the Plant Sciences , Praeger Publishers, 
New York, New York, pp. 163-185, (1981); and Sakaguchi, K. in 
Current Topic in Microbiology and Immunology 96:31-45, (1982), 
each of which is specifically incorporated herein by reference. 

One particularly preferred construction employs the plasmid 
RSF1010 and derivatives thereof as described by Bagdasarian, M. , 
Bagdasarian, M.M. , Coleman, S. , and Timmis, K.N. in Plasmids of 
Medical Environmental and Commerci al Importance, Timmis, K.N. and 
Puhler, A. eds., Elsevier/North Holland Biomedical Press, (1979), 
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!j specifically incorporated herein by reference. The advantages of 
Ij RSF10I0 are that it is relatively small, high copy number plasmid 
;| which is readily transformed into and stably maintained in both 
! e. coli and Pseudomonas species. In this system, it is preferred 
: to use the Tac expression system as described for Escherichia , 
!! since it appears that the E. coli trp promoter is readily recog- 
nized by Pseudomonas RNA polymerase as set forth by Sakaguchi, K. 
. in Current Topics in Microbiology and Immunology 96:31-45 (1982) 
and Gray, G.L., McKeovn, K.A., Jones, A.J.S., Seeburg, P.H., and 
Heyneker, H. L. in Biotechnology Feb. 1984, pp. 161-165, both of 
which are specifically incorporated herein by reference, Tran- 
0 scriptional activity may be further maximized by requiring the 

v5 exchange of the promoter with, e.g., an E. coli or P. aeruginosa 

y trp promoter. 

Ifl In a preferred embodiment, P. aeruginosa is transformed with 

vectors directing the synthesis of the metalloproteinase inhib- 
it itor as either an intracellular product or as a product coupled 
^ to leader sequences that will effect its processing and export 
0 from the cell. In this embodiment, these leader sequences are 

preferably selected from the group consisting of beta-lactamase, 
OmpA protein, the naturally occurring human signal peptide, and 
that of carboxyept idase G2 from Pseudomonas . Translation may be 
coupled to translation initiation for any of the E. coli proteins 
as described in Example 2, as veil as to initiation sites for any 
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of the highly expressed proteins of the host to cause in- 
• tracellular expression of the metalloproteinase inhibitor. 

In those cases where restriction minus strains of a host 
Pseudomonas species are not available, transformation efficiency 
with plasmid constructs isolated from E. coli are poor. There- 
fore, passage of the Pseudomonas cloning vector through an r- m+ 
strain of another species prior to transformation of the desired 
host, as set forth in Bagdasarian, M. , et al. f Plasmids of 
Medical, Environmental and Commercial Importance , pp. 411-422, 
Timmis and Puhler eds., Elsevier/North Holland Biomedical Press 
(1979), specifically incorporated herein by reference, is de- 
sired. 

Furthermore, a preferred expression system in hosts of the 
genera Baci llus involves using plasmid pUBHO as the cloning ve- 
hicle. As in other host vector systems, it is possible in 
Bacillus to express the metalloproteinase inhibitors of the pres- 
ent invention as either an intracellular or a secreted protein. 
The present embodiments include both systems. Shuttle vectors 
that replicate in both Bacillus and E. coli are available for 
constructing and testing various genes as described by Dubnau, 
D., Gryczan, T., Contente, S., and Shivakumar, A.G. in Genetic 
Engineering , Vol. 2, Setlov and Hollander eds., Plenum Press, New 
York, New York, pp. 115-131, (1980), specifically incorporated 
herein by reference. For the expression and secretion of 
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metalloproteinase inhibitors from B. subtil is , the signal se- 
; quencer of alpha-amylase is preferably coupled to the coding re- 
i gion for the metalloproteinase inhibitor. For synthesis of 
' intracellular metalloproteinase inhibitor, the portable DNA se- 
quence will be translationally coupled tc the ribosome binding 
site of the alpha-amylase leader sequence. 

Transcription of either of these constructs is preferably 
; directed by the alpha-amylase promoter or a derivative thereof. 
This derivative contains the RNA polymerase recognition sequence 
of the native alpha-amylase promoter but incorporates the lac op- 
erator region as well. Similar hybrid promoters constructed frorr 
the penicillinase gene promoter and the lac operator have been 
shown to function in Bacillus hosts in a regulatable fashion as 
set forth by Yansura, D.G. and Henner in Genetics and Biotechnol- 
oqv of Bacilli , Ganesan, A.T. and Hoch, J. A., eds., Academic 
Press, pp. 249-263, (1984), specifically incorporated by refer- 
ence. The lad gene of lacl^ would also be included to effect 
regulation. 

One preferred construction for expression in Clostridium is 
in plasmid pJUl2 described by Squires, C. H. et al in J. 
Bacteriol. 159:465-471 (1984), specifically incorporated herein 
by reference, transformed into C. perfringens by the method of 
Heefner, D. L. et al. as described in J. Bacteriol. 159:460-464 
(1984), specifically incorporated herein by reference. 
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Transcription is directed by the promoter of the tetracycline re- 
sistance gene. Translation is coupled to the Shine-Dalgarno se- 
^ quences of this same tet r gene in a manner strictly analogous to 
; the procedures outlined above for vectors suitable for use in 
other hosts. 

Maintenance of foreign DNA introduced into yeast can be ef- 
! fected in several ways (Botstein, D. f and Davis, R. W., in The 
j Molecular Biology of the Yeast Saccharomyces , Cold Spring Harbor 
'Laboratory, Strathern f Jones and Broach, eds., pp. 607-636 
(1982). One preferred expression system for use with host organ- 
isms of the genus Saccharomyces harbors the ant icollagenase gene 
on the 2 micron plasmid. The advantages of the 2 micron circle 
include relatively high copy number and stability when introduced 
into cir° strains. These vectors perferably incorporate the rep- 
lication origin and at least one antibiotic resistance marker 
from pBR322 to allow replication and selection in E. coli . In 
addition, the plasmid will preferably have 2 micron sequences and 
the yeast LEU 2 gene to serve the same purposes in LEU2 mutants of 
yeast. 

The regulatable promoter from the yeast GAL1 gene will pref- 
erably be adapted to direct transcription of the portable DNA se- 
quence gene. Translation of the portable DNA sequence in yeast 
will be coupled to the leader sequence that directs the secretion 
of yeast alpha-factor. This will cause formation of a fusion 
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ij protein which will be processed in yeast and result in secretion 

of a ntetalloproteinase inhibitor. Alternatively, a methionyl- 
i! metalloproteinase inhibitor will be translated for inclusion 
within the cell. 

It is anticipated that translation of mRNA coding for zhe 
metalloproteinase inhibitor in yeast will be more efficient with 
the preferred codon usage of yeast than with the sequence presen 
< in pUC8-Fic, as identified in Example 2, which has been tailored 
. to the prokaryotic bias. For this reason, the portion of the 5 r 
: end of the portable DNA sequence beginning at the Tth llll site i 
preferably resynthesized. The new sequence favors the codons 
most frequently used in yeast* This new sequence preferably has 
the following nucleotide sequence: 
HgiAI 

5' GAT CCG TGC ACT TGT GTT CCA CCA CAC 
GC ACG TGA ACA CAA GGT GGT GTG 

CCA CAA ACT GCT TTC TGT AAC TCT GAC C 
GGT GTT TGA CGA AAG ACA TTG AGA CTG GA 3' 
As will be seen from an examination of the individual 
cloning vectors and systems contained on the above list and de- 
scription, various operational elements may be present in each c 
the preferred vectors of the present invention. It is contem- 
plated any additional operational elements which may be required 
may be added to these vectors using methods known to those of 
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J ordinary ckill in the art, particularly in light of the teachings 

| hereiriV 

i 

| In practice, it is possible to construct each of these 

i 

\ vectors in a way that allows them to be easily isolated, assem- 
, j bled, and interchanged. This facilitates assembly of numerous 

f j functional genes from combinations of these elements and the 

i; 

,j coding region of the metalloproteinase inhibitor. Further, many 
.[of these elements will be applicable in more than one host, 
i At least one origin of replication recognized by the contem- 
plated host microorganism, along with at least one selectable 
'marker and at least one promoter sequence capable of initiating 
transcription of the portable DNA sequence are contemplated as 
being included in these vectors. It is additionally contemplated 
that the vectors, in certain preferred embodiments, will contain 
DNA sequences capable of functioning as regulators ("operators"), 
and other DNA sequences capable of coding for regulator proteins* 
* In preferred vectors of this series, the vectors additionally 

contain ribosome binding sites, transcription terminators and 
leader sequences. 

These regulators, in one embodiment, will serve to prevent 
expression of the portable DNA sequence in the presence of cer- 
tain environmental conditions and, in the presence of other envi- 
ronmental conditions, allow transcription and subsequent expres- 
sion of the protein coded for by the portable DNA sequence. In 



law orrices -q^ 

Finnecan. Henderson ~ 
F/^rabow. Garrett 

ft DUNNER, 

\7*t% * STBCtT. N W. 
WA9MINCTON, O. C.2000* 
<202i 293-«*SO 



particular, it preferred that regulatory segments be inserted 
I into the vector such that expression of the portable DNA sequence 
will not occur in the absence of, for example, isopro- 
■ pylthio-P* -d-galactoside. In this situation, the transformed mi- 
croorganisms containing the portable DNA may be grovn to a de- 
sired density prior to initiation of the expression of the 
metalloproteinase inhibitor. In this embodiment, expression of 
the desired protease inhibitor is induced by addition of a sub- 
stance to the microbial environment capable of causing expression 
of the DNA sequence after the desired density has been achieved. 

Additionally, it is preferred that an appropriate secretory 
leader sequence be present, either in the vector or at the 5' end 
of the portable DNA sequence, the leader sequence being in a 
position which allows the leader sequence to be immediately adja- 
cent to the initial portion of the nucleotide sequence capable of 
directing expression of the protease inhibitor without any inter- 
vening transcription or translation termination signals. The 
presence of the leader sequence is desired in part for one or 
more of the following reasons: 1) the presence of the leader se- 
quence may facilitate host processing of the initial product to 
the mature recombinant metalloproteinase inhibitor; 2) the pres- 
ence of the leader sequence may facilitate purification of the 
recombinant metalloproteinase inhibitors, through directing the 
metalloproteinase inhibitor out of the cell cytoplasm; 3) the 



L4W ounces ""30** 

Finnegan. Henderson 
Farabow. Garrett 
8 Dunner 

1773 K STHtCT. M.W. 
WASHINGTON. 0. C.ZOOO« 



presence of the leader seauence may affect the ability of the 
recombinant metalloproteinase inhibitor to fold to its active 
structure through directing the metalloproteinase inhibitor out 
of the cell cytoplasm. 

In particular, the leader sequence may direct cleavage of 
the initial translation product by a leader peptidase to remove 
the leader sequence and leave a polypeptide with the amino acid 
sequence which has the potential of metalloproteinase inhibitory 
activity. In some species of host microorganisms, the presence 
'of the appropriate leader sequence will allow transport of the 

i 

completed protein into the periplasmic space, as in the case of 
E. coli. In the case of certain yeasts and strains of Bacillus 
and Pseudomonas , the appropriate leader sequence will allow 
transport of the protein through the cell membrane and into the 
extracellular medium. In this situation, the protein may be 
purified from extracellular protein. 

Thirdly, in the case of some of the metalloproteinase inhib- 
itors prepared by the present invention, the presence of the 
leader sequence may be necessary to locate the completed protein 
in an environment where it may fold to assume its active struc- 
ture, which structure possesses the appropriate metalloproteinase 
activity. 

Additional operational elements include, but are not limited 
to, ribosome-binding sites and other DNA sequences necessary for 
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!! 

, 1 
I 

I 

I microbial expression of foreign ~- ^teins. The operational ele- 
■ iments as discussed herein can be routinely selected by those of 
• ordinary skill in the art in light of prior literature and the 
teachings contained herein. General examples of these operation- 
al elements are set forth in B. Levin, Genes , Wiley & Sons, New 
: York (1983), which is specifically incorporated herein by refer- 
ence. Various examples of suitable operational elements may be 
I found on the vectors discussed above and may be elucidated 
'through review of the publications discussing the basic charac- 
teristics of the aforementioned vectors. 

In one preferred embodiment of the present invention, an 
additional DNA sequence is located immediately preceding the por- 
table DNA sequence which codes for the metal loproteinase inhib- 
itor. The additional DNA sequence is capable of functioning as a 
translational coupler, i.e., it is a DNA sequence that encodes an 
RNA which serves to position ribosomes immediately adjacent to 
the ribosome binding site of the metalloproteinase inhibitor RNA 
with which it is contiguous. 

Upon synthesis and/or isolation of all necessary and desired 
component parts of the above-discussed cloning vectors, the 
vectors are assembled by methods generally known to those of or- 
dinary skill in the art. Assembly of such vectors is believed to 
be within the duties and tasks performed by those with ordinary 
skill in the art and, as such, is capable of being performed 
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without undue experimentation. For examp]* similar DMA se- 
quences have been ligated into appropriate cloning vectors, as 
set forth in Schoner et al . , Proceedings of the National Academy 
of Sciences U.S.A., 81:5403-5407 (1984), which is specifically 
; incorporated herein by reference. 

"I In construction of the cloning vectors of the present inven- 

tion, it should additionally be noted that multiple copies of the 
: ! portable DNA sequence and its attendant operational elements may 

• be inserted into each vector. In such an embodiment, the host 

it 

■organism would produce greater amounts per vector of the desired 
"metalloproteinase inhibitor. The number of multiple copies of 
the DNA sequence which may be inserted into the vector is limited 
only by the ability of the resultant vector, due to its size, to 
be transferred into and replicated and transcribed in an appro- 
priate host microorganism. 

Additionally, it is preferred that the cloning vector con- 
tain a selectable marker, such as a drug resistance marker or 
other marker which causes expression of a selectable trait by the 
host microorganism. In a particularly preferred embodiment of 
the present invention, the gene for ampicillin resistance is in- 
cluded in vector pUC9-F5/237P10 . 

' Such a drug resistance or other selectable marker is intend- 

ed in part to facilitate in the selection of transf ormants . 
Additionally, the presence of such a selectable marker on the 
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cloning vector may be of use in keeping contaminating microorga- 
| nisms *f rom multiplying in the culture medium* In this embodi- 

Smeat, such a pure culture of the transformed host microorganisms 

i 

j would be obtained by culturing the microorganisms under condi- 
tions which require the induced phenotype for survival. 

I 

• It is noted that, in preferred embodiment, it is also desir- 

sable to reconstruct the 3' end of the coding region to allow 
; assembly with 3 1 non-translated sequences* Included among these 
non-translated sequences are those which stabilize the mRNA or 
enhance its transcription and those that provide strong tran- 
scriptional termination signals which may stabilize the vector as 
they are identified by Gentz, R. , Langner, A., Chang, A.C.Y., 
Cohen, S.H., and Bujard,'H. in Proc* Natl. Acad* S,ci* USA 
713:4936-4940 (1981), specifically incorporated herein by refer- 
ence* 

This invention also relates to a recombinant-DNA method for 
the production of metallproteinase inhibitors* Generally, this 
method includes: 

(a) preparation of a portable DNA sequence capable of 
directing a host microorganism to produce a protein 
having metalloproteinase inhibitor activity; 

(b) cloning the portable DNA sequence into a vector capable 
of being transferred into and replicating in a host mi- 
croorganism, such vector containing operational 
elements for the portable DNA sequence; 
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(c) transferring the vector containing the portable DUA se- 
quence and operational elements into a host microorga- 
nism capable of expressing the metalloproteinase inhib- 
itor protein; 

(d) culturing the host microorganism under conditions 
appropriate for amplification of the vector and expres- 
sion of the inhibitor; and 

(e) in either order: 

(i) harvesting the inhibitor; and 

(ii) causing the inhibitor to assume an active, terti- 
ary structure whereby it possesses metallopro- 
teinase inhibitor activity. 

In this method, the portable DNA sequences are those syn- 
thetic or naturally-occurring polynucleotides described above. 
In a preferred embodiment of the present method, the portable DNA 
sequence has the nucleotide sequence as follows: 



10 20 30 40 50 60 

GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT GTGTCCCACC CCACCCACAG 

70 80 90 100 110 120 

ACGGCCTTCT GCAATTCCGA CCTCGTCATC AGGGCCAAGT TCGTGGGGAC ACCAGAAGTC 

130 140 150 160 170 180 

AACCAGACCA CCTTATACCA GCGTTATGAG ATCAAGATGA CCAAGATGTA TAAAGGGTTC 

190 200 210 220 230 240 

CAAGCCTTAG GGGATGCCGC TGACATCCGG TTCGTCTACA CCCCCGCCAT GGAGAGTGTC 

250 260 270 280 290 300 

TGCGGATACT TCCACAGGTC CCACAACCGC AGCGAGGAGT TTCTCATTGC TGGAAAACTG 
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II 310 320 330 340 350 360 

I CAGGATGGAC TCTTGCACAT CACTACCTGC AGTTTCGTGG CTCCCTGGAA CAGCCTGAGC 

j 370 380 390 400 410 420 

| TTAGCTCAGC GCCGGGGCTT CACCAAGACC TACACTGTTG GCTGTGAGGA ATGCACAGTG 

' 430 440 450 460 470 480 

TTTCCCTGTT TATCCATCCC CTGCAAACTG CAGAGTGGCA CTCATTGCTT GTGGACGGAC 

.i 490 500 510 520 530 540 

: CAGCTCCTCC AAGGCTCTGA AAAGGGCTTC CAGTCCCGTC ACCTTGCCTG CCTGCCTCGG 

550 560 570 580 590 600 

GAGCCAGGGC TGTGCACCTG GCAGTCCCTG CGGTCCCAGA TAGCCTGAAT CCTGCCCGGA 

\ 610 620 630 640 650 660 

: GTGGAAGCTG AAGCCTGCAC AGTGTCCACC CTGTTCCCAC TCCCATCTTT CTTCCGGACA 

670 680 -690 700 

jATGAAATAAA GAGTTACCAC CCAGCAAAAA AAAAAAGGAA ttc 

The vectors contemplated as being useful in the present 
method are those described above. In a preferred embodiment, the 
cloning vector pUC9-F5/237P10 is used in the disclosed method. 

The vector thus obtained is then transferred into the appro- 
priate host microorganism. It is believed that any microorganism 
having the ability to take up exogenous DNA and express those 
genes and attendant operational elements may be chosen. It is 
preferred that the host microorganism be an anaerobe, facultative 
anaerobe or aerobe. Particular hosts which may be preferable for 
use in this method include yeasts and bacteria. Specific yeasts 
include those of the genus Saccharomyces , and especially 
Saccharomyces cerevisiae . 

Specific bacteria include those of the genera Bacillus and 
Escherichia and Pseudomonas. Various other preferred hosts are 
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set forth in Table I, supra . In other, alternatively preferred 
embodiments of the present invention, Bacillus subtil is , 

Escherichia coli or Pseudomonas aeruginosa are selected as the 

' l 

: I host microorganisms. 

After a host organism has been chosen, the vector is trans- 

i 

,iferred into the host organism using methods generally known by 
1 those of ordinary skill in the art. Examples of such methods may 

; i 

i : be found in Advanced Bacterial Genetics by R. w. Davis et al. , 
;1 Cold Spring Harbor Press, Cold Spring Harbor, New York, (1980), 
; which is specifically incorporated herein by reference. It is 
preferred, in one embodiment, that the transformation occur at 
^ low temperatures, as temperature regulation is contemplated as a 

means of regulating gene expression through the use of operation- 
W al elements as set forth above. In another embodiment, if 

W osmolar regulators have been inserted into the vector, regulation 

H of the salt concentrations during the transformation would be re- 

U quired to insure appropriate control of the synthetic genes, 

y, If it is contemplated that the recombinant metalloproteinase 

t% inhibitors will ultimately be expressed in yeast, it is preferred 

r ' J ' that the cloning vector first be transferred into Escherichia 

coli , where the vector would be allowed to replicate and from 
which the vector would be obtained and purified after amplifica- 
tion. The vector would then be transferred into the yeast for 
ultimate expression of the metalloproteinase inhibitor. 
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The host microorganisms are cultured under conditions appro- 
! priate r "for the expression of the metalloproteinase inhibitor, 
j These conditions are generally specific for the host organism, 
;i and are readily determined by one of ordinary skill in the art, 
j in light of the published literature regarding the growth condi- 
tions for such organisms, for example Beroev's Manual of 
. Determinative Bacteriology , 8th Ed., Williams & Wilkins Company, 
! Baltimore, Maryland, which is specifically incorporated herein by 
reference. 

Any conditions necessary for the regulation of the expres- 
sion of the- DNA sequence, dependent upon any operational elements 
! inserted into or present in the vector, would be in effect at the 
transformation and culturing stages. In one embodiment, the 
cells are grown to a high density in the presence of appropriate 
regulatory conditions which inhibit the expression of the DNA se- 
quence. When optimal cell density is approached, the environ- 
mental conditions are altered to those appropriate for expression 
of the portable DNA sequence. It is thus contemplated that the 
production of the metalloproteinase inhibitor will occur in a 
time span subsequent to the growth of the host cells to near 
■optimal density, and that the resultant metalloproteinase inhib- 
itor will be harvested at some time after the regulatory condi- 
tions necessary for its expression were induced. 
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In a preferred embodiment of the present invention, the 
recombinant metalloproteinase inhibitor is purified subsequent tc 
{ harvesting and prior to assumption of its active structure. This 
! embodiment is preferred as the inventors believe that recovery of 
I a high yield of re-folded protein is facilitated if the protein 
| is first purified. However, in one preferred, alternate embodi- 

' intent, the metalloproteinase inhibitor may be allowed re-fold to 

il 

ij assume its active structure prior to purification. In yet 

|| 

t ; another preferred, alternate embodiment, the metalloproteinase 
.j inhibitor is caused to assume its re-folded, active state upon 
; | recovery from the culturing medium. 
P ! In certain circumstances, the metalloproteinase inhibitor 

,will assume its proper, active structure upon expression in the 
host microorganism and transport of the protein through the cell 
vail or membrane or into the periplasmic space- This will gener- 
ally occur if DNA coding for an appropriate leader sequence has 
been linked to the DNA coding for the recombinant protein. The 
preferred metalloprot xena^e inhibitors of the present invention 
will assume their mature, active form upon translocation out of 
the inner cell membrane. The structures of numerous signal 
peptides have been published, for example by Marion E.E. Watson 
! in Nuc. Acid Res. 12:515-5164, 1984, specifically incorporated 
therein by reference. It is intended that these leader sequences, 
together with portable DNA, will direct intracellular production 
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;j of a fusion protein vhich will be transported through the cell 

'! membrane and will have the leader sequence portion cleaved uDon 

j 

|| release from the cell. 

; j 

'\ In a preferred embodiment, the signal peptide of the E. col i 

■ OmpA protein is used as a leader sequence and is located in a 

^position contiguous vith the portable DNA sequence coding for the 
: metalloproteinese inhibitor structure. 

j Additionally preferred leader sequences include those of 

i 

beta-lactamase, carboxypept idase G2 and the human signal protein. 

t 

; These and other leader sequences are described. 

! If the metalloproteinase inhibitor does not assume its prop- 

er, active structure, any disulfide bonds vhich have formed 
and/or any noncovalent interactions which have occurred will 
first be disrupted by denaturing and reducing agents, for exam- 
ple, guanidinium chloride and Js-mercaptoethanol , before the 
metalloproteinase inhibitor is allowed to assume its active 
structure following dilution and oxidation of these agents under 
controlled conditions. 

The transcription terminators contemplated herein serve to 
stabilize the vector. In particular, those sequences as de- 
scribed by Gentz et al . , in Proc. Natl. Acad. Sci. USA 78: 
4936-4940 (1981), specifically incorporated herein by reference, 
are contemplated for use in the present invention. 
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It is to be understood that application of the teachings of 
I the present invention to a specific problem or environment will 
be within the capabilities of one having ordinary skill in the 
art in light of the teachings contained herein. Examples of the 
; products of the present invention and representative processes 

i! for their isolation and manufacture appear in the following exam- 

i i 

■ ; pies. 

*' EXAMPLES 

, | 

I EXAMPLE 1 

Preparation of Po1y(A*) RNS from HEF-SA Fibroblasts 

' HEF-SA cells were grown to near confluence in 75 cm 2 

T-flasks, Cells were washed twice in Dulbecco's phosphate buff- 
ered saline solution and harvested by the addition of 2 ml of 10 

i 

mM Tris f pH 7.5 containing 1% w/v SDS (obtained from BDH chemi- 
cals, Ltd., Poole, England), 5 mM EDTA and 20 ug/ml protease K 
(obtained from Boehringer Mannheim Biochemicals , Indianapolis, 
Indiana). Each flask was subsequently washed with an additional 
milliliter of this same solution. 

The pooled aliquots from the cell harvest were made to 70 
ug/ml in protease K and incubated at 40°C for 45 minutes. The 
proteolyzed solution was brought to a NaCl concentration of 150 
mM by the addition of 5 M stock and subsequently extracted with 
an equal volume of phenol : chloroform 1:1. The aqueous phase was 
reextracted with an equal volume of chloroform. Two volumes of 
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Ij ethanol were added to the aqueous phase and incubated overnight 
'{ at -20-°C. The precipitated nucleic acids were recovered by 
jj centrif ugation at 17,500 xg for 10 minutes in a Beckman J2-21 
■'centrifuge, Beckman Instruments, Palo Alto, California, and were 
redissolved in 25 ml of 0.1% w/v SDS. This solution was again 

' i 

'extracted vith an equal volume of chloroform. The aqueous phase 
'| was added to two volumes of cold ethanol and kept at -20°C for 2 
i 1 hours. The precipitate was collected by centrif ugation at 10,000 

! i 
I 

l \xg for 15 minutes and redissolved in 10 ml of 1 mM Tris, 0.5 mM 
EDTA, 0.1% SDS, pH 7.5. RNA was precipitated from this solution 
by the addition of 10 ml of 4 M LiCl, 20 mM NaoAc, pH 5*0 and in- 
!cubated at -20°C for 18 hours. The precipitate vas again recov- 
^ ered by centrif ugation and 4 washed twice with 2 M LiCl before 

^ redissolving in 1 mM Tris, 0.5 mM EDTA, 0.1% SDS, pH 7.5. This 

''4. 

^ solution was stored at -70°C. 

^ Chromatography on Oliqo dT Cellulose 

M' Total cellular RNA prepared as above was ethanol precipi- 

tin 

U tated and redissolved in 0.5 M NaCl. Five ml of RNA at 0.45 

Q mg/ml were applied to a 1 ml column of washed type VII oligo dT 

id, 

cellulose (obtained from PL Biochemicals , Milwaukee, Wisconsin). 
The column was then washed with 10 ml of 0.5 M NaCl and eluted 
with 2.0 ml of sterile H2O. The eluted poly(A + ) fraction of RNA 
was ethanol precipitated and dissolved to give a 1 mg/ml solution 
in 1 mM Tris, 0.1 mM EDTA, pH 8.0. This was stored at -70°C. 
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cDNA Synthesis 



Paly(A + ) RNA was primed with oligo dT (obtained from PL 
Biochemicals, Milwaukee, Wisconsin) to serve as a template for 
cDNA synthesis by AMV reverse transcriptase (obtained from Life 
Sciences, Inc., St* Petersburg, Florida). Following the synthe- 
sis reaction, the RNA was hydrolyzed by the addition of 0.1 vol- 
ume of 3 N NaOH and incubated at 67°C for 10 minutes. The solu- 

Ition was then neutralized and the cDNA purified by gel filtration 

f 

: chromatography on biogel A 1.5 (obtained from BioRad La- 
boratories, Richmond, California) in a 0.7x25 cm column in a 10 
jmM Tris, 5 mM EDTA, and 1% SDS, pH 7.5 solution. Fractions con- 
taining cDNA were pooled and concentrated by ethanol precipita- 
tion. The cDNA was dG tailed and purified by gel filtration 
using the procedure set forth above. Second strand synthesis was 
primed with oligo dC and polymerized in an initial reaction with 
the large (Rlenow) fragment of DNA polymerase (obtained from 
Boehringer Mannheim). Following second strand synthesis, E. col i 
DNA polymerase I (obtained from Boehringer Mannheim) was added 
and incubation continued to form blunt ends. The double stranded 
cDNA was again purified by chromatography. Eco RI restriction 
sites within the cDNAs were modified by the action of Eco RI 

i 

methylase, obtained from New England Biolabs, Beverly, Mas- 
sachusetts. The cDNA was again purified and ligated to synthetic 
Eco RI linkers. Finally, the ends were then trimmed with the 
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endonuclease and the cDNA purified by gel filtration. This DNA 
was ligated into a unique EcoRI site in lambda gtlO DNA packaged 
in vitro and used to infect E. coli strain hflA according to the 
method set forth by Huynh, T.V. , Young, R.A., and Davis, R.W., in 
;': DNA Cloning Techniques , A Practical Approach (ed. Glover, D.M.) 
(IRL Press Oxford), in press , specifically incorporated herein by 
reference. Approximately 25,000 recombinants were amplified in 
this manner. 
Screening 

j Recombinant-phage-containing sequences of interest were se- 
lected by their preferential hybridization to synthetic oligo- 
nucleotides encoding portions of the primary structure of the de- 
■ sired metalloproteinase inhibitor, hereinafter referred to as 
'FIBAC. These portions of the protein sequence correspond in part 
! to those set forth in the published literature by Stricklin, G.P. 
! and Welgus, H.G., J. .Biol. Chem. 258: 12252-12258 (1983), specif- 
ically incorporated herein by reference. Recombinant phage were 
used to infect E. coli strain hflA and plated at a density of 
approximately 2xl0 3 pfu/150 mm petri dish. Phage were blotted 
onto nitrocellulose f i Iters .( BA85 , Schleicher & Schuell Inc., 
jlteene, New Hampshire), and DNA was denatured and fixed essential- 
ly as described by Benton and Davis in Science 196:180-182 (1979) 
■I spec if ically incorporated herein by reference. 
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Using that procedure, the filters vere treated sequentially 
ij for 10-15 minutes each in 0*5 M NaCl, then 1 0 M Tris, 1.5 M NaCl 
SpH 8.0, and finally submerged in 2x SSPE. (2x SSPE is 0.36 M 
NaCl f 20 mM NaH2P04, 2 mM EDTA pH 7.4). Filters were blotted dry 
iand baked 75°-80° for 3-4 hours. Duplicate filters vere made of 

i 

I each plate. Filters were prehybridized for 1-3 hours at 37° in 
5x SSPE containing O.lx SET, 0.15% NaPPi, and lx Denhardts solu- 
tions. Filters vere then hybridized for 72 hours at 37° in this 

f same solution containing 5xl0 5 cpm/ml of 5' end-labeled* 51-mer 
oligonucleotide specific activity approximately 10 6 cpm/pmole). 
Folloving hybridization, filters vere vashed six times in 5x SSPE 
containing O.lx SET and 0.05% sodium pyrophosphate at 37°, then 
three times in 2x SSPE at 21°. These vere then blotted dry and 
autoradiographed on Kodak XAR-5 film at -70° with a Kodak 
lightening-plus intensifying screen. Signals clearly visible 
from duplicate filters vere used to pick phage for plaque purifi- 
cation. Filter preparations and hybridization procedures for 
plaque purification steps were the same as above. The vashing 
procedure was simplified to 6 changes of 2x SSPE at 37°, Six 
isolates purified by repetitive plating vere then arranged on a 
single lavn of E. coli strain C600 for testing vith subsequent 
probes ♦ 

Preferential hybridization of the 17-mer to each of the iso- 
lates (as opposed to control plaques) vas observed under a 
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;! condition identical to that used for plaque purif ica* icn. Probe 
*! c was used in a similar test, except that the SSPE concentration 

"r 

'during hybridization was reduced to 4x. Again, each of the iso- 
lates demonstrated stronger hybridization to the probe than did 
control plaques, 

■ Phage Purification and cDNA Characterization 

Quantities of each of the six isolated phage were made by 
/the plate stock technique and purified by serial CsCl block gra- 
' dient centrif ugat ion. DNA vas extracted from these by dialysis 
against 50% formamide as described by Davis, R.W., Botstein, D. f 
Roth, J.R., in A Manual for Genetic Engineering: Advanced Bacte- 
rial Genetics , 1980, Cold Spring Harbor Laboratory, specifically 
incorporated herein by reference. DNA from each of the isolates 
was digested with Eco RI and the products were analyzed by agarose 
gel electrophoresis. The insert from one of the larger clones, 
lambda FIBAC 5, was found to lack internal sites for Sal I , 
Hindi 1 1 , BamHI , and EcoRI. The cDNA insert was released from 
lambda FIBAC 5 DNA and the lambda arms digested by co-digesting 
with these four enzymes. The fragments were then ethanol- 
precipitated and ligated into the Eco RI site of plasmid pUC9 
without further purification. These plasmids were then used to 
transform E. coli strain JM83. Transf ormants were selected on 
ampicillin containing plates. Plasmids from several trans- 
f ormants were purified and characterized on the basis of the 
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EcoRI digestion products* One was selected which had an insert 
co-migrating with the insert from lambda FIBAC 5 on agarose gel 
electrophoresis. This plasmid has been named pUC9-F5/237P10 . 
Mapping and Subcloninq 

The insert in pUC9-F5/237P10 was mapped with respect to in- 

jj 

Internal Pst I sites. Double digests with Eco RI and Pst demon- 
strated three internal Pst I recognition sites. The entire insert 
ijand the component pieces were subcloned into M13 bacteriophage 

! j 

! :mpl9 and mpl8, respectively. Sequencing of the pieces was per- 
formed by the dideoxynucleot ide method described by Sanger et aL 
; in Sanger, F. , Nicklen, S., and Coulson, A.R., Proc. Natl. Acad. 
Sci. USA 74-5463-5467 (1977), specifically incorporated herein by 
reference. 

The sequence of the DNA insert from pUC9-F5/237P10 showed an 
open reading frame which encodes the primary structure of a ma- 
ture fibroblast collagenase inhibitor biologically equivalent to 
that isolable from human skin fibroblasts. The salient features 
of the sequence are: 

(1) The insert is flanked by EcoRI restriction 
sites and by G/C and A/T homopolymer ic tracts 
consistent with the cloning methodology; 

(2) The coding strand is presented in the 5 ! to 
i 3* convention with poly C at the 5' end and 

poly A at the 3 ? end, again consistent with 
the techniques employed; 

(3) If the first G in the sequence GTTGTTG imme- 
diately adjacent to the 3' end of the poly C 
tract is considered as nucleotide 1, then an 
open reading frame is presented which encodes 
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the primary structure of the mature human 
fibroblast collagenase inhibitor beginning at 
nucleotide 34 and continuing through 
nucleotide 585; 

(4) The termination codon TGA at nucleotides 586 
through 588 defines the carboxy terminus of 
the translation product which is the same as 
that of the mature protein; 

(5) Nucleotides 1 through 33 define an amino acid 
sequence which is not found in the primary 
structure of the processed protein, but which 
is probably a portion of a leader peptide 
characteristic of secreted proteins; 

(6) The three internal PstI sites have as their 
first base nucleotides 298, 327, and 448; 

(7) There is a single recognition sequence for 
the restriction enzyme Tthllll beginning at 
nucleotide 78; and 

(8) There is a single recognition sequence for 

■ the restriction endonuclease Ncol beginning 
at nucleotide 227, 

The sequence of nucleotides 1 through 703 and restriction 



analysis are shown. 
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i! 

! # 
|j ACC 1 .(GTVWAC) 1 

j ALU 1 (AGCT) 4 

■ 

I 

AVA 1 (CQCGPG) 1 

! AVA 2 ( GGRCC ) 3 



BBV 1 (GCTGC) 1 



BST Nl { CCRGG ) 3 



* DDE 1 (CTNAG) 4 

jU: 
Jrfr 
W| 

ECO Rl (GAATTC) 1 



FNU4H 1 (GCNGC) 2 
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SITES FRAGMENTS FRAGMENTS ENDS 



214 


495 


(69.8) 


214 


709 




214 


(30.2) 


1 


214 


358 


358 


(50.5) 


1 


358 


363 


124 


(17.5) 


482 


606 


482 


119 


(16.8) 


363 


482 


606 


103 


(14.5) 


606 


709 




5 


( 0.7) 


358 


363 


536 


536 


(75.6) 


1 


536 




173 


(24.4) 


536 


709 


257 


257 


(36.2) 


1 


257 


477 


220 


(31.0) 


257 


477 


572 


137 


(19.3) 


572 


709 




95 


(13.4) 


477 


572 


269 


440 


(62.1) 


269 


709 




269 


(37.9) 


1 


269 


344 


344 


(48.5) 


1 


344 


544 


200 


(28.2) 


344 


544 


557 


152 


(21.4) 


557 


709 




13 


( 1.8) 


544 


557 


186 


344 


(48.5) 


365 


709 


355 


186 


(26.2) 


1 


186 


360 


169 


(23.8) 


186 


355 


365 


5 


( 0.7) 


360 


365 


5 


( 0.7) 


355 


360 


698 


698 


(98.4) 


1 


698 




11 


( 1.6) 


698 


709 


196 


440 


(62.1) 


269 


709 


269 


196 


(27.6) 


1 


196 


73 


(10.3) 


196 


269 
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FOK 1-4GGATG) 



HAE 2 (PGCGCQ) 



HAE 3 (GGCC) 



HGI Al (GRGCRC) 



HHA 1 (GCGC) 



HINC 2 ( GTQPAC ) 



HINF 1 (GANTC) 



HP A 2 (CCGG) 



HPH 1 (GGTGA) 



# 
4 



SITES 



FRAGMENTS 



FRAGMENTS ENDS 



192 


274 


(38.6) 


435 


709 


204 


192 


(27.1) 


1 


192 


303 


132 


(18.6) 


303 


435 


435 


99 


(14.0) 


204 


303 




12 


( 1.7) 


192 


204 


368 


368 


(51.9) 


1 


368 




341 


(48.1) 


368 


709 


30 


616 


(86.9) 


93 


709 


63 


33 


( 4.7) 


30 


63 


93 


30 


( 4.2) 


63 


93 




30 


( 4.2) 


1 


30 


552 


552 


(77.9) 


1 


552 




157 


(22.1) 


552 


709 


369 


369 


(52.0) 


1 


369 




340 


(48.0) 


369 


709 


118 


591 


(83.4) 


118 


709 




118 


(16.6) 


1 


118 


308 


308 


(43.4) 


1 


308 


587 


279 


(39.4) 


308 


587 




122 


(17.2) 


587 


709 


207 


224 


(31.6) 


372 


596 


372- 


207 


(29.2) 


1 


207 


596 


165 


(23.3) 


207 


372 


654 


58 


( 8.2) 


596 


654 




55 


( 7.8) 


654 


709 


380 


380 


(53.6) 


1 


380 


519 


190 


(26.8) 


519 


709 




139 


(19.6) 


380 


519 
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I # SITES FRAGMENTS FRAGMENTS ENDS 

s'lMBO 2 (GAAGA) 1 



MNL 1 (CCTC) 



650 



81 
274 
406 
486 
535 



650 


(91.7) 


1 


650 


59 


( 8.3) 


650 


70S 


193 


(27.2) 


81 


274 


174 


(24.5) 


535 


709 


132 


(18.6) 


274 


406 


81 


(11.4) 


1 


81 


80 


(11.3) 


406 


a a c 
486 


49 


( 6.9) 


486 


535 


524 


(73.9) 


185 


709 


185 


(26.1) 


1 


185 



, ; MST 2 (CCTNAGG) 1 
i 185 



NCI 1 (CCSGG) 2 ^ 2 (52<s) x 3?2 

595 223 (31.5) 372 595 

114 (16.1) 595 709 

NCO 1 ( CCATGG ) 1 ^ ^ { ^ ^ ^ ?Q9 

227 (32.0) 1 227 

NSP B2 (CVGCWG) 1 ^ ^ (?2>2) ig? 7Q9 

197 (27.8) 1 197 



PST 1 ( CTGCAG ) 3 ^ ^ { ^ Q) ± 29g 

327 
448 



261 (36.8) 448 709 

121 (17.1) 327 448 



29 ( 4.1) 298 327 

SAU 1 (CCTNAGG) 1 ^ (73>g) 18g 709 

185 (26.1) 1 185 

SAU 3A (GATC) 1 ^ (?8>8) 15Q 709 

150 (21.2) 1 150 
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I 



!'SAU96.i (GGNCC) 



# 

5 



SITES 



29 
92 
257 
477 
572 



FRAGMENTS 



FRAGMENTS ENDS 



o *5 n 


\ J 1 . u / 


257 


477 


165 


(23.3) 


92 


257 


137 


(19.3) 


572 


709 


95 


(13.4) 


477 


572 


63 


( 8.9) 


29 


92 


29 


( 4.1) 


1 


29 



SCR Fl (CCNGG) 



SFA Ml ( GATGC ) 



TTH111 1 
(GACNNNGTC) 



344 
372 
544 
557 
595 



193 



79 



*5 A A 

344 


f A Q CL\ 

I 4o • 3 ) 


\ 


344 


172 


(24.3) 


372 


544 


114 


(16.1) 


595 


709 


38 


( 5.4) 


557 


595 


28 


( 3.9) 


344 


372 


13 


( 1.8) 


544 


557 


516 


(72.8) 


193 


709 


193 


(27.2) 


1 


193 


630 


(88.9) 


79 


709 


79 


(11.1) 


1 


79 
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The following do not appear: 



10 





2 


AFL 


2 


AFL 3 


AHA 3 


APA 


1 


ASU 


2 


AVA 3 


AVR 2 


BAL 


1 


BAM 


HI 


BCL 1 


BGL 1 


BGL 


2 


BIN 


1 


BSSH 1 


BST E2 


CFR 


1 


CLA 


1 


ECO R5 


FNUD 2 


GDI 


2 


HAE 


1 


HGA 1 


HGI CI 


HGI 


Dl 


HGI 


32 


HIND 3 


HP A 1 


KPN 


1 


MLU 


1 


MST 1 


NAE 1 


NAR 


1 


NDE 

i~ 


1 


NRU 1 


NSP CI 


PVU 


1 


PVU 


2 


RRU 1 


RSA 1 


SAC 


1 


SAC 


2 


SAL 1 


SMA 1 


SNA 


1 


SPH 


1 


STU 1 


TAQ 1 


XBA 


1 


XHO 


1 


XHO 2 


XMA 3 


XMN 


JL 














20 




30 


40 



50 



60 



GTTGTTGCTG TGGCTGATAG CCCCAGCAGG GCCTGCACCT GTGTCCCACC CCACCCACAG 

SH 
AA 
UE 

i 13 

70 80 ?0 100 110 120 

ACGGCCTTCT GCAATTCCGA CCTCGTCATC AGGGCCAAGT TCGTGGGGAC ACCAGAAGTC 

H T M SH H 

A T N AA I 

E H L UE N 

3 1 1 13 2 



130 140 150 160 170 180 

AACCAGACCA CCTTATACCA GCGTTATGAG ATCAAGATGA CCAAGATGTA TAAAGGGTTC 

S 
A 
U 
A 



190 200 210 220 230 240 

CAAGCCTTAG GGGATGCCGC TGACATCCGG TTCGTCTACA CCCCCGCCAT GGAGAGTGTC 



SD 
AD 
UE 
11 



FS FN 

OF NS 

KA UP 

11 12 



F 
0 
K 
1 



H 
P 
A 
2 



A 

C 
C 
1 



N 
C 

O 

1 
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250 260 270 280 290 30C 

TGCGGATACT TCCACAGGTC CCACAACCGC AGCGAGGAGT TTCTCATTGC TGGAAAACTG 

ABM ? 
V B N S 

A V L T 

2 11 1 







310 




J w w 


340 




CAGGATGGAC 






AGTTTCGTGG 




F 


H 




P 






0 


I 




s 






• K 


N 




1 






1 


1 




T 








370 






*1 U w 




1 TTAGCTCAGC 


GCCGGGGCTT 


CACCAAGACC 


TACACTGTTG 




A D 


HH 


N H 








L D 


AH 


C ? 








U E 


EA 


I H 








, 1 1 


21 


1 1 










430 


A A ft 

44U 


*± D U 


*t o u 




TTTCCCTGTT 


TATCCATCCC 


CTGCAAACTG 


CAGAGTGGCA 








r 


p 










0 


S 










K 
1 


T 

1 








490 


500 


510 


520 




CAGCTCCTCC 


AAGGCTCTGA 


AAAGGGCTTC 


CAGTCCCGTC 




A M 








H 




L N 








P 


?4: 


U L 

1 1 








H 
1 



350 



B 
S 
T 
1 



410 



360 
ICTGAGC 
DAD 
D L D 
E U E 
111 

420 



M 
N 
L 
1 



470 



480 



A 

V 
A 
2 



530 



540 



MA 
NV 
LA 
11 



590 



600 



550 560 570 580 

GAGCCAGGGC TGTGCACCTG GCAGTCCCTG CGGTCCCAGA TAGCCTGAAT CCTGCCCGGA 

B A H NH 

S V I CP 

T A M IA 

1 2 1 12 



B 
S 
T 
1 



H 
G 
I 
1 



650 



660 



GTGGAAGCTG AAGCCTGCAC AGTGTCCACC CTGTTCCCAC TCCCATCTTT CTTCCGGACA 
% M H 

L B P 

U 0 A 

1 2 2 
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670 680 690 700 

ATGAAATAAA GAGTTACCAC CCAGCaaaAA AAAAAAGGAA TTC 

E 
C 

0 

1 



; | EXAMPLE 2-EXPRESSION OF COLLAGENESE INHIBITOR IN E. COLI 
[\ In this Example, a preferred method of coupling a preferred 

portable DNA sequence to* the 5 f end of the cloned cDNA is set 
forth. This involves making a nucleolytic cleavage at a speci- 
fied point within the coding sequence and reconstructing the de- 
■ ! 

•'sired portion of the coding sequence by means of synthetic 
'oligonucleotides in a manner that allows its excision and recom- 
bination (i.e., by incorporating useful restriction sites). 

Trimming the 5* end of the coding region will be 
accomplished by synthesizing both strands of the DNA extending 
from the Tthllll site in the 5' direction and ending in a BamHI 
overhang. This synthetic oligonucleotide, referred to as FIBAC 
A, has the following features: 

(1) Codon selection has been biased toward those most fre 
quently found in the genes of highly expressed bacteri 
al proteins; 

(2) A methionine codon from which to initiate translation 
has been provided immediately upstream from the 
cysteine which begins the coding region of human pro- 
cessed FIBAC; 

(3) The spacing of the Bam HI site to the methionine codon 
is such that when cloned into pUC8 , the coding region 
of FIBAC will be in-frame with the 5' end of the beta- 
galactos idase gene; 
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(4) An in-frame stop codon and Shine Dalgarno sequence are 
also presented. Translaticr. of this frame for the 
amino terminal portion of the beta-galactosidase is 
terminated at the TAA codon, and translation of FI3AC 
should be initiated at the following ATG; 

(5) Codons have been selected to create a Hqi AI site begin 
ning with the G in the FIBAC initiation codon; and 

(6) There is a Pvu l site separated by one base from the 3' 
end of the Bam HI sequence. 

The structure of FIBAC A is 

GA TCC GCG ATC GGA GTG TAA GAA ATG TGC ACT 
G CGC TAG CCT CAC ATT CTT TAC ACG TGA 

TGC GTT CCG CCG CAT CCG CAG ACT GCT TTC 
ACG CAA GGC GGC GTA GGC GTC TGA CGA AAG 

TGC AAC TCT GAC C 
ACG TTG AGA CTG GA 

FIBAC A is synthesized using the ABI DNA synthesizer (Foste 

City, California) as a series of four component oligonucleotides 

Component oligonucleotide FA1 is: 

GATCC GCGAT CGGAG TGTAA GAAAT GTGCA CTTGC 

Component oligonucleotide FA2 is: 

GGAACG CAAGT GCACA TTTCT TACAC TCCGA TCGCG 

Component oligonucleotide FA3 is: 

GTTC CGCCG CATCC GCAGA CTGCT TTCTG CAACT CTGAC C 

Component oligonucleotide FA4 is: 

AGGTC AGAGT TGCAG AAAGC AGTCT GCGGA TGCGG C 

The remainder of the coding portion of the FIBAC gene is 
isolated as the 3 f Tth llll to Eco RI fragment generated by a dou- 
ble digest of pUC9-F5/237P10 with these enzymes. 

A synthetic linker is made to couple the 3* end of the 
Tthllll to EcoRI fragment to a Sail site. These oligonucleotide 



-56- 



' will be designed to recreate the Sai l site and destroy the Zzo Rl 

i. site, The linker is comprised of the oligonucleotides linker Al 

;i 

l| and linker A2 . 

j j 

: | Linker Al is: AATTGGCAG 

Linker A2 is: TCGACTGCC 

These oligonucleotides and oligonucleotides FA1-FA4 are 
kinased separately and annealed in equal molar ratios vith the 

■Tthllll to ScoRI 3* end of the cDNA and BamHI/Sall cut mpl9RF 

! 

' DNA« The ligated DNA is used to transfect JM105. Plaques are 
picked by their color in the presence of IPTG and X-gal and by 
hybridization to oligonucleotide FA2. Several positive plaques 
are to be sequenced. Those containing the designed sequence are 
subcloned into BamH I/ Sal l digested pUC8, Translation of the 
FIBAC gene in this construct is coupled to translation initiated 
for beta-galactosidase. This expression vector is referred to a 
pUC8-Fic. 

Coupling translation of FI3AC to translation initiated for 
other highly expressed proteins is similarly arranged. For exan 
pie, a portion of the OmpA gene which contains the Shine-Dalgarr 
and initiator methionine sequences has been synthesized. This 
sequence encodes the entire signal peptide of OmpA protein and 
had convenient restriction sites, including those for EcoRI, 
Eco RV, Pvu l , and Stul. 
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■ The sequence of the sense strand is: 

jl 

'j 10 20 30 40 5C 6C 

Hgaattcgata tctcgttgga gatattcatg acgtattttg gatgataacg aggcgcaaa^ 

I E T E F M H 

!IC AC 0 N H 

! 0 Q 0 K LA 

■■: 1 1 5 1 11 

70 80 90 100 110 

, . AATGAAAAAG ACAGCTATCG CGATCGCAGT GGCACTGGCT GGTTTCGCTA CCGTA 
A NF PS 
L RN VA 
U UU UU 
1 12 1A 

120 130 

GCGCA GGCCTCTGGT AAAAGCTT 
H S H M HA 
H T A N IL 
A U E L NU 
113 1 31 

This sequence is hereinafter referred to as OmpA leader. 
Coupling the translation of FIBAC to OmpA is accomplished by cut- 
ting the pUC8-Fic with Pvul and Sai l and isolating the coding re- 
gion. This, together with the EcoRI to Pvu l fragment isolated 
from OmpA leader, will be cloned into EcoRI /Sal I -cut pUC8. As ir 
the prior example, transcription is driven by the lac promoter 
and regulated by the lac I gene product at the lac operator. 
This FIBAC expression vector is referred to as pUC8-F/OmpAic . 

To effect the translocation of FIBAC out of the inner cell 
membrane, an appropriate leader sequence is added to the amino 
terminus of FIBAC. The protein thus produced vill be translo- 
cated and processed to yield the mature form. 
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:j To effect such a translocation, a FIBAC gene encoding the 

ii 

' signal- peptide of the E. coli OmpA protein continuous vith the 
!j structural region of FIBAC is created. This particular FI3AC 
:!gene necessitates having in frame stop codons at the 5 1 end of 

the FIBAC coding region changed. To accomplish this, the portion 
-of the 5* coding region from pUCS-Fic that extends from the He i A I 

site to the Ncol site is isolated. Upstream sequences are 
: resynthesized as a linker having cohesive ends from BamHI and 

Hgi AI and containing an internal Stu I site* This is synthesized 

as two oligonucleotides, linker Bl and linker B2. 
Linker Bl is: GATCCCAGGCCTGCA 
Linker B2 is: GGCCTGG 

Linkers Bl and B2 are kinased separately and annealed in 
equal molar ratios vith the KgjAI to Ncol fragment described 
above and Bam HI / Nco l cut pUC8-Fic* The resulting construct has 
the coding sequence of FIBAC in frame vith the translation of th 
amino terminus of beta-galactos idase. Translation of this se- 
quence forms a fusion protein vith FIBAC. This plasmid is 
referred to as pUCB-Ff. 

Attaching the OmpA leader sequences to the coding region of 
FIBAC is accomplished by ligating EcoRI/StuI cut pUC8-Ff vith ar 
excess of the purified EcoRI to Stu I fragment of OmpA leader. 
Folloving transformation, plasmids from several colonies vill be 
characterized by hybridization. Those that have incorporated t 
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: 'ompA leader fragment are characterized further to verify the 

; i 

'structure. This plasmid, pUC8-F OmpAl, will direct the synthesis 

. ! 
t 

'jof a fusion protein beginning in the signal peptide of the 

ii 

ilcoli OmpA protein and ending in human FIBAC. The signals present 
in the OmpA portion of the protein effect the protein's export 
'from the cytoplasm and appropriate cleavage from the primary 
structure of FIBAC. 

, if the efficiency of expression were to be compromised by 

'the sequence of the leader peptide or its combination with FIBAC 
either at the protein or at the nucleic acid level, the gene 
could be altered to encode any of several known E. coli leader 
sequences . 

Transcription of all of the genes discussed is effected by 
the lac promoter. As in the case of initiation sites for trans- 
lation, the promoter and operator region of the gene may be 
interchanged. FIBAC may also be expressed from vectors incor- 
porating the lambda P L promoter and operator (0 L ) , and the hybric 
promoter operator, Tac as described in Amann, E. , Brosius, J., 
and Ptashne, M. Gene 25:167-178 (1983), specifically incorporatec 
herein by reference. Excision of those portions of the gene 
including ribosome binding site structural region and 3' non- 
translated sequences and insertion in alternate vectors contain- 
ing the P L or Tac promoter makes use of the unique restriction 
sites that flank these structures in pUC8-F/0mpAic and 
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1 pUC8-F/0mpAl. Insertion of the EcoRI to Sail fragment from ei- 

i 

! ther into similarly digested plasmid pDP8 effects transcription 

■ of these genes directed by the lambda promoter. Transcrio- 

I 

i tional regulation would be temperature sensitive by merit of the 
cl857 mutation harbored on this same plasmid. 

Putting similar gene fragments into the transcription unit 
, of the Tac promoter will be accomplished by first isolating the 
. Eco RV to Sai l fragment. This, together with the synthetic Tac 
.promoter sequence which is flanked by BamH I and PvuII sites and 
which contains the lac operator will be inserted into the Bam HI 
to Sai l sites of pBR322 or preferably derivatives. The deriva- 
tives in this case refer to constructs containing either the lad 
gene or the I 3 gene. 

Expression of FIBAC in host microorganisms other than 
Escherichia is considered. Yeast and bacteria of the genera 
Bacillus , Pseudomonas , and Clostridium may each offer particular 
advantages. The processes outlined above could easily be adaptec 
to others. 

In general, expression vectors for any microorganism will 
embody features analogous to those which we have incorporated in 
the above mentioned vectors of E. coli . In some cases, it will 
be possible to simply move the specific gene constructs discussec 
above directly into a vector compatible with the new host. In 
others, it may be necessary or desirable to alter certain 
operational or structural elements of the gene. 
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ij EXAMPLE 3 

;i 

;j The human collagenase inhibitor may be readily purified 

i| 

after expression in a variety of microbes. In each case, the 

H • - 

lispectrum of contaminant proteins will differ. Thus, appropriate 
purification steps will be selected from a variety of steps 
already known to give a good separation of the human collagenase 
inhibitor from other proteins and from other procedures which are 
jlikely to work. 

1 If the inhibitor is not secreted from the microbes, it may 

form inclusion bodies inside the recombinant microbes. These 
bodies are separated from other proteins by differential 
centrifugation after disruption of the cells with a French Press. 
The insoluble inclusion bodies are solubilized in 6 M guanidine 
hydrochloride or 8 M urea, and the inhibitor protein is more com- 
pletely solubilized by reaction of its cysteines with sodium 
sulfite. At any time subsequent to this step, the cysteines are 
converted back to their reduced form with dithiothreitol . Once 
the inhibitor protein is solubilized from inclusion bodies, 
immunoaf f inity chromatography using antibodies raised against the 
unfolded inhibitor are used .for purification before refolding. 

The inhibitor can be refolded according to the protocol men- 
tioned in Example 6, infra . After refolding of the inhibitor, or 
if the inhibitor is secreted from the microbes, purification from 
other proteins is accomplished by a variety of methods. Initial 
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I 
I 
j 

1 1 

listens include ultrafiltration through a 50 K dalton cutoff mem- 

!J 

l)brane or ammonium sulfate fractionation. Other useful methods 

jl 

jinclude, but are not limited to, ion-exchange chromatography, gel 

\\ 

.filtration, hepar in-sepharose chromatography, reversed-phase 

■chromatography, or zinc-chelate chromatography. All of these 

i 

; steps have been successfully used in purification protocols. 
Additional high resolution steps include hydrophobic interaction 
chromatography or immunoaf f inity chromatography. After purifica- 
tion, the metalloproteinase inhibitor is preferably at least 
90-95% pure, 

EXAMPLE 4 

O * Purification of Human Collagenase Inhibitor from Human 

Amniotic Fluid 

tt Human amniotic fluid obtained from discarded amniocentesis 

Ul samples was pooled and 6 liters were subjected to ultrafiltration 

sjj through a 100 kD MW cutoff filter, obtained from Millipore Corpo- 

y, ration, in a Millipore Pellicon Cassette System. The eluate was 

y : concentrated through a 10 kD cutoff filter, obtained from 

% Millipore Corporation, then through an Amicon PM-10 membrane. 

Aliquots (10 ml) of concentrated amniotic fluid were eluted 
through a 2.5 x 100 cm column of Ultrogel AcA54 , obtained from 
LKB Corporation, which was equilibrated with pH 7.6, 0.05 M 
hepes, 1 M sodium chloride, 0,01 M calcium chloride, and 0.02% 
sodium azide (all chemicals were obtained from Sigma Chemical 
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Company). Fractions containing the inhibitor were collected and 
'pooled, dialyzed against pH 7.5, 0.025 M Hepes buffer containing 
0.01 M calcium chloride and 0.02% sodium azide, and loaded onto a 
:! 1.5 x 28 cm heparin-sepharose CL-63 (obtained from Pharmacia, 
Inc.) column equilibrated with the same buffer. This column was 
rinsed with 1 liter of the above buffer and eluted with a linear 
gradient of 0-0.3 M sodium chloride. The fractions from the 
largest peak of inhibitor activity, eluting at about 0.1-0.15 M 
sodium chloride, were pooled, concentrated to 1 ml, and loaded 
'onto a Synchropak rp-8 reverse phase HPLC column equilibrated 
with 0.05% trif luoroacetic acid (Aldrich Chemical Company). The 
column was eluted with a linear gradient of 0-40% acetonitrile 
(J. T. Baker Chemical Company) at 1/2% per minute. All fractions 
were immediately dried in a Savant speed-vac concentrator to re- 
move acetonitrile, and redissolved in pH 7.5, 0.1 M Hepes before 
assay. The inhibitor eluted between 32-38% acetonitrile. Frac- 
tions containing the inhibitor were pooled, and 100 ul aliquots 
were eluted over a Bio-rad biosil-TSK 250 HPLC gel filtration 
column. The pooled peaks of inhibitor activity contained 0.1 mg 
of inhibitor, which was over 95% pure as judged by SDS- poly- 
acrylamide gel electrophoresis. 
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■! EXAMPLE 5 

1 

ii Purification of Human Fibroblast Collagenase Inhibitor from 
i 1 . Human Embryonic Skin Fibroblast Serum-Free Medium 

Human embryonic skin fibroblasts were grown in serum-free 
tissue culture medium. Ten liters of this medium were collected, 
dialyzed against pH 7.5, 0.02 M hepes buffer containing 0.02% so- 
'dium azide and 0.01 M calcium chloride, and applied to a 2.8 x 48 
cm column of hepar in-sepharose CL-6B (Pharmacia, Inc.) 
equilibrated with the same buffer. The column was rinsed with 2 
liters of this buffer and was then eluted with linear gradient of 
0-0.3 M sodium chloride contained in this buffer. The fractions 
obtained were tested for the presence of inhibitor by their abil- 
ity to inhibit human fibroblast collagenase. The fractions cor- 
responding to the peak of activity were those obtained near 0.15 
M sodium chloride. These fractions were concentrated to about 5 
ml by ultrafiltration through an Amicon YM10 filter and the con- 
centrate was applied in four separate runs to a 250 x 4.1 mm 
Synchropak rp-8 reverse phase HPLC column, equilibrated with 1% 
trif luoroacetic acid. The column was eluted with a 0-60% linear 
gradient of acetonitrile in 0.1% trif luoroacetic acid. The gra- 
dient was run at 1/2% acetontrile per minute. The inhibitor 
eluted in two sharp peaks between 26-29% acetonitrile. All frac- 
tions were immediately dried in a Savant speed-vac concentrator, 
redissolved in pH 7.5, 0.1 M Hepes, and assayed. At least 1.2 mg 
of collagenase inhibitor was recovered, which was 90-95% pure. 
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!■ This material gives a single band when run on a 17.5% reducing 
''' SDS gel. After carboxymethylation of the cysteines and elution 

through the same rp-8 column under identical conditions, the in- 
' hibitor is suitably homogenous for protein sequencing. 

EXAMPLE 6 

It is contemplated that the human collagenase inhibitor can 
be readily refolded into its native structure from its denatured 
'state after expression of its gene in a microbe and separation of 
the collagenase inhibitor from most of the other proteins pro- 
duced by the microbe. By analogy to the conditions necessary for 
the refolding of other disulf ide-contanning proteins as set forth 
by Freedman, R. B. and Hillson, D. A., in "Formation of Disulfide 
Bonds," In: The Enzymoloov of Post-Trans lational Modification of 
Proteins , Vol. 1, R. B. Freedman and H. C. Hawkins, eds., pp. 
158-207 (1980), specifically incorporated herein by reference, 
refolding of the human collagenase inhibitor should occur in so- 
lutions with a pH of 8.0 or greater. At this pH, the cysteines 
of the protein are partially ionized, and this condition is nec- 
essary for the attainment of native disulfide bond pairings. The 
inhibitor concentration should be relatively low, less than 0.1 
mg/ml, to minimize the formation of intermolecular disulfide- 
linked aggregates which will interfere with the refolding pro- 
cess . 
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! Since the stability of the refolded (native) disulfide bond- 

ied structure relative to the unfolded (reduced) structure depends 
'on both the solution oxidation-reduction potential and the con- 
centrations of other redox-active molecules, it is contemplated 

that the redox potential should be buffered with a redox buffer 
^giving a potential equivalent to a reduced: oxidized glutathione 

ratio of 10- The preferred concentration range of reduced 

■ i 

glutathione would be 0,1-1. 0 mM. At higher concentrations, mixed 

i ■ 

disulfides will form with protein, reducing the yield of the 
refolded (native) structure. The relative stabilities of the 
unfolded protein and the native structure, and thus the rate and 
yield of refolding, will also depend on other solution variables, 
such as the pH, temperature, type of hydrogen-ion buffer, ionic 
strength, and the presence or absence of particular anions or 
cations as discussed in Privalov, P. L., "Stability of Proteins, 
Small Globular Proteins," in Advances in Protein Chemistry, Vol. 
33, pp. 167-236, (1979), specifically incorporated herein by ref- 
erence. These conditions vary for every protein and can be de- 
termined experimentally. It is contemplated that addition of any 
molecule that strongly prefers to bind the native (as opposed to 
the unfolded) structure, and which can be readily separated 
afterwards from the native (refolded) protein, will increase not 
only the yield but the rate of re-folding. These molecules in- " 
elude monoclonal antibodies raised against the native structure, 
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jj and other proteins which tightly bind the native collagenase in- 

hhibitor, such as the mammalian enzyme* collagenase or gelatinase 

,! Example 7 
! j = 

ii The second preferred sequence as set forth herein, i.e., 



1 A 






A A 

40 


50 


6 


GGCCATCGCC 


GCAGATCCAG 


CGCCCAGAGA 


GACACCAGAG 


AACCCACCAT 


GGCCCCCTT 


H F 


XB HH 






N 


S 


& XT 


UT A LI 

fli An 






V. 


A 


X? TT 


Ufi XliA 






o 


U 










1 


1 


/ U 


OU 


Q A 

y u 


1 A A 


1 1 ft 

110 


1 2 


^ x f* r* 
GACCLL rtjUv- 






CjTCjCjLTGATA 


GCCLCAGCAG 


GGCCTGCAC 


B 


SF 






s 


H 


S 


FO 






A 


A 


T 


AK 






U 


E 


1 


11 






1 


3 


130 


140 


150 


160 


170 


18C 


TGTGTCCCAC 


CCCACCCACA 


GACGGCCTTC 


TGCAATTCCG 


ACCTCGTCAT 


CAGGGCCAAC 






•H 


T 


M 


SH 






A 


T 


N 


AA 






E 


H 


L 


UE 






3 


1 


1 


13 



190 



200 



210 



220 



230 



24( 



TTCGTGGGGA CACCAGAAGT CAACCAGACC ACCTTATACC AGCGTTATGA GATCAAGATC 



H 
I 
N 
2 



S 
A 
U 
A 



250 260 270 280 290 30C 

ACCAAGATGT ATAAAGGGTT CCAAGCCTTA GGGGATGCCG CTGACATCCG GTTCGTCTAC 

SD FS FN F H A 

AD OF NS OP C 

UE KA UP K A C 

11 11 12 12 1 



310 320 330 340 350 360 

ACCCCCGCCA TGGAGAGTGT CTGCGGATAC TTCCACAGGT CCCACAACCG CAGCGAGGAG 

N ABM 

C V B N 

O' A V L 

1 2 11 
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,i — - " rtrt 42C 

TTTCTCATTG CTGGAAAACT GCAGGATGGA CTCTTGCACA TCACTACCTG CAGTTTC 

P P H P 

i! 



370 


380 




390 


400 


410 


TTTCTCATTG 


CTGGAAAACT 


GCAGGATGGA 


CTCTTGCACA 


TCACTACCTG 




o 

sr 


F 


H 




p 




s 


0 


I 




s 




T 


K 


N 




T 




1 


1 


1 




1 


430 












GCTCCCTGGA 


AC 











B 
S 
T 
1 

has the following restriction sites: 

# SITES FRAGMENTS FRAGMENTS ENDS 



ACC 1 (GTVWAC) 



AVA 2 (GGRCC) 



BBV 1 (GCTGC) 



BIN 1 (GGATC) 



BST Nl (CCRGG) 



295 295 (68.3) 1 295 

137 (31.7) 295 432 



338 338 (78.2) 1 338 

94 (21.8) 338 432 



350 350 (81.0) 1 350 

82 (19.0) 350 432 



14 418 (96.8) 14 432 

14 ( 3.2) 1 14 



65 360 (83.3) 65 425 

425 65 (15.0) 1 65 

7 ( 1.6) 425 432 



l^w orricc* 

Finnecan. Henderson -fiQ- 
Farabow. Garrett " 0 "" 
& Dunner 

I77S K STHCCT. N W. 
WASHINGTON. C.200Q1 



DDE l-(CTNAG) 



FNU4H 1 (GCNGC) 



# 

1 



SITES 



267 



FRAGMENTS 



FRAGMENTS ENDS 



8 

277 

350 



FOK 1 ( GGATG ) 



76 
273 
285 
384' 



HAE 2 (PGCGCQ) 



19 



267 
165 



269 
82 
73 
8 



197 
99 
76 
48 
12 



413 
19 



(61.8) 
(38.2) 



(62.3) 
(19.0) 
(16.9) 
( 1.9) 



(45.6) 
(22.9) 
(17.6) 
(11.1) 
( 2.8) 



(95.6) 
( 4.4) 



1 

267 



8 

350 
277 
1 



76 
285 

1 

384 
273 



19 
1 



267 
432 



277 
432 
350 
8 



273 
384 
76 
432 
285 



432 

19 



HAE 3 (GGCC) 



1 
51 
111 
144 
174 



258 

60 
50 
33 
30 
1 



(59.7) 
(13.9) 
(11.6) 
( 7.6) 
( 6.9) 
( 0.2) 



174 
51 
1 

111 
144 

1 



432 
111 
51 
144 
174 
1 



HHA 1 (GCGC) 



20 



412 
20 



(95.4) 
( 4.6) 



20 
1 



432 
20 



HINC 2 (GTQPAC) 



199 



233 
199 



(53.9) 
(46.1) 



199 

1 



432 
199 
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-70- 



fafc 



il 

I! HINF 1 (GANTC) 



' HP A 2 (CCGG) 



MNL 1 (CCTC) 



MST 2 (CCTNAGG) 



NCO 1 (CCATGG) 



NSP B2 (CVGCWG) 



PST 1 (CTGCAG) 



SAU 1 (CCTNAGG) 



# SITES FRAGMENTS FRAGMFNTS END* 
1 

389 389 (90.0) 1 389 

43 (10.0) 389 432 



288 288 (66.7) 1 288 

144 (33.3) 288 432 



162 193 (44.7) 162 355 

355 162 (37.5) 1 162 

77 (17.8) 355 432 



266 266 (61.6) 1 266 

166 (38.4) 266 432 



'47 261 (60.4) 47 308 

308 124 (28.7) 308 432 

47 (10.9) 1 47 



278 278 (64.4) 1 278 

154 (35.6) 278 432 



379 379 (87.7) 1 379 

408 29 ( 6.7) 379 408 

24 ( 5.6) 408 432 



266 266 (61.6) 1 266 

166 (38.4) 266 432 
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SAU 3 A- (GATC) 



# SITES 
2 

14 

231 



SAU 9 6 1 (GGNCC) 



51 
110 
173 
338 



SCR Fl (CCNGG) 



65 
425 



SFA Nl (GATGC) 



75 
274 



STY 1 { CCRRGG ) 



47 
308 



TTH111 1 (GACNNNGTC) 



160 



FRAGMENTS 



217 (50.2) 
201 (46.5) 
14 { 3.2) 



165 
94 
63 
59 
51 



272 
160 



(38.2) 
(21.8) 
(14.6) 
(13.7) 
(11.8) 



360 (83.3) 
65 (15.0) 
7 ( 1.6) 



199 (46.1) 
158 (36.6) 
75 (17.4) 



261 (60.4) 
124 (28.7) 
47 (10.9) 



(63.0) 
(37.0) 



FRAGMENTS ENDS 



14 
231 
1 



173 
338 
110 
51 
1 



65 
1 

425 



75 
274 

1 



47 
308 
1 



160 
1 



231 
432 
14 



338 
432 
173 
110 
51 



425 
65 
432 



274 
432 
75 



308 
432 
47 



432 
160 



XHO 2 (PGATCQ) 



13 



419 
13 



(97.0) 
( 3.0) 



13 
1 



432 
13 



The following do not appear: 



AAT 2 
AHA 3 
AVA 1 



AFL 2 
ALU 1 
AVA 3 



AFL 3 
APA 1 
AVR 2 



AHA 2 
ASU 2 
BAL 1 
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1 1 



BAM nl 


DihSi 


i 


BbL 1 




0 

& 


« ecu 1 


DO 1 




ECO Rl 




PR 


HAE x 


rlvsA 


*l 

X 


HGI Dx 




(J Z 


TTT^TT T 

HFri 1 




1 

-L 


Mb i 1 


watt 

ft A Hi 


1 
X 






1 
X 


WOr l-X 


PVTI 

i V \J 


1 


RSA 1 


oAU 


1 


SCA 1 


SMA 


1 


SPE 1 


SPH 


1 


TAQ 1 


XBA 


1 


XMN 1 







RAM 9 




1 


pew 1 
con x 


Dor 




v*r rv. x 




X 


PWTtn 0 

r N\ju z 






nvj I Ax 


no i 




uTMn q 


HPA 


1 

X 




Mt TT 


T 
X 


MAP 1 


Mr* t 


X 


Di\J 1 X 


MPTT 


1 

X 


Jr VU ^ 


DPTT 


X 


SAC 2 


SAL 


1 


SNA 1 


SNA 


Bl 


SSP 1 


STU 


1 


XHO 1 


XMA 


3 



The salient features of this cDNA are: 



= II; 



1. The coding strand is presented in the 5* to 
3 1 convention with the polyC tract at the 5* 
end. 

2. If the first G in the sequence GGC CAT CGC 
CGC is considered as nucleotide 1, then an 
open reading frame exists from nucleotide 1 
through nucleotide 432, which is the 3 T end 
of this partial cDNA. 

3. The first methionine in this reading frame is 
encoded by nucleotides 49 through 51 and 
represents • the initiation site of transla- 
tion. 

4. The amino acid sequence prescribed by 
nucleotides 49 through 114 is not found in 
the primary structure of the mature protein, 
but it is the sequence of the leader peptide 
of human protein* 

5. The sequence of nucleotides 82 through 432 is 
identical to the sequence of nucleotides 
numbered 1 through 351 in the insert from 
the first preferred sequence of Example 1. 

6. The amino acid sequence of the mature protein 
displays two consensus sequences for sugar 
attachment. These sequences, -N-Q-T- pre- 
scribed by nucleotides 202 through 210 and 
-N-R-S- prescribed by nucleotides 346 
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through 354, are amino acid residues 30 
through 32 and 78 through 80, respectively, 
in the mature protein* Both sites are 
glycosylated in the human inhibitor protein. 

It will be apparent to those skilled in the art that varic 

modifications and variations can be made in the processes and 

products of the present invention* Thus, it is intended that t 

present invention cover the modifications and variations of thi 

invention provided they come within the scope of the appended 

claims and their equivalence* 



4 
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